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Methods and kits for diagnosing and treating B-Cell Chronic Lymphocytic 
Leukemia (B-CLL) 

All patent and non-patent references cited in the present application, are hereby 
incorporated by reference in their entirety. 

5 Field of invention 

The present invention relates to methods and kits for detecting several polynucleotide 
sequence found to be indicative of a poor prognosis of B-CLL. All the polynucleotides are 
transcribed from a region on human chromosome 12p21-22. Most of the polynucleotides 
do not encode larger polypeptides, but may encode small peptides, they may function as 
10 RNAs. Four polynucleotides encode a novel protein, which in one preferred embodiment 
can be used as a cytokine, preferably as an interleukin. Furthermore the invention relates 
to methods and compositions for treating B-CLL in particular poor prognosis B-CLL. 



Background of invention 

B-CLL is the most common form of leukaemia in Denmark, with more than 250 new cases 
15 diagnosed every year. The disease results in accumulation of CD19+CD5+CD23+ 

lymphocytes in the blood, bone marrow and organs of the patients. B-CLL cells are long- 
lived, slowly dividing and locked in the G, phase of the cell cycle. At this time it is unknown 
how or why B-CLL occurs and no cure is known for B-CLL. The application of more 
aggressive treatment strategies has been hampered by the inability to identify 
20 reproducible and reliable prognostic predictors in patients with poor outcome in this 
disease. In many patients the diagnosis does not affect morbidity or mortality. Other 
patients suffer from an incurable cancer that inevitably results in death, regardless of 
treatment. Until recently this latter group of patients could not be identified at the time of 
diagnosis. Recently, two studies established the mutational status of immunoglobulin 
25 variable region of the heavy chain (Ig V H ) genes in B-CLL as independent prognostic 
markers, within each clinical stage (Damle, et al. & Hamblin, et al.). Patients without 
somatic hypermutation show much shorter survival than patients with somatic 
hypermutation. FISH-studies of cytogenetic aberrations in B-CLL established specific 
abnormalities on chromosomes 11 (ATM), 12 (?), 13 (Leu-1 and-2) and 17 (p53) as 
30 independent prognostic markers, within each clinical stage (Dohner, et al.). Very recent 
studies have demonstrated that independent risk prediction, using a combined analysis of 
Ig V H gene mutational analysis and cytogenetics, can identify subgroups of B-CLL with 
median survivals ranging from less than 2.5 years to more than 15 years (Krober, et al., 
Lin, et al., & Oscier, et al.) (see Figure 1). Since the process of characterising the Ig VH 
35 gene mutational status of an individual patient is cumbersome, it is desirable to provide 
easier tests based on diagnostic markers for use in the differential diagnosis of such cancer 
patients. 
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Summary of invention 

It is an object of preferred embodiments of the present invention to provide differentially 
expressed transcription products, which can be used as prognostic markers of disease and 
give information about the differences in etiology between different groups of B-CLL 
5 patients. These differentially expressed transcription products are genetic markers that can 
be used in an easy assay to distinguish between subgroups of B-CLL patients and 
especially identify B-CLL patients with a poor prognosis. 

This method for diagnosing a subtype of B-cell chronic lymphocytic leukaemia (B-CLL) 
10 comprises the steps of determining the presence or absence of at least one expression 
product such as a transcriptional product which comprise a nucleotide sequence selected 
from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, 
SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18 in a biological sample isolated from a 
individual. As evidenced by the appended examples, the present inventors have 
15 determined that the expression products of this invention are present in one subtype of B- 
CLL having poor prognosis and thus of great diagnostic value and independent prognostic 
value. Equally important, an expression product comprising a nucleotide sequence selected 
from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, 
SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18 has not been found in any of the other 
20 tissue types tested (see e.g. Figure 8). 

The vast majority of patients which show expression of the AMB-1 gene in form of at least 
one of the expression products selected from the group consisting of SEQ ID No: 12, SEQ 
ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18 

25 show unmutated Ig V(H) genes which is consistent with poor prognosis B-CLL. The 
presence of an expression product of the AMB-1 gene can be determined easily using 
standard laboratory procedures and equipment. Therefore the diagnostic method provided 
by the present inventors provides an easy method of diagnosis as compared to the 
determination of the mutation status of Ig V(H) genes and can furthermore give additional 

30 information about the prognosis. 

Accordingly, a further object of preferred embodiments of the present invention is a 
method for determining the progress of B-CLL comprising determining the amount of at 
least one expression product which comprise a nucleotide sequence selected from the 
35 group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID 
No: 16, SEQ ID No: 17 and SEQ ID No: 18 in a biological sample isolated from an individual. 
The method may be used e.g. for determining the efficiency of a treatment, i.e. to see 
whether the amount of the expression product decreases or increases in response to a 
curative treatment. 

40 

The expression products of the present invention are all transcripts of SEQ ID NO:l and/or 
SEQ ID No:5, the gene of the present invention called AMB-1 which also encodes a novel 
polypeptide (SEQ ID NO: 3). 
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A further object of preferred embodiments of the present invention is to provide a cure 
and/or treatment of patients with B-CLL, in particular of patients with poor prognosis B- 
CLL such as the sub-type of B-CLL which is characterised by the presence of an expression 
product of the present invention. 

5 

The method for treating B-CLL comprises administering to an individuai with a B-CLL 
diagnosis a compound capable of decreasing or inhibiting the formation of an expression 
product of SEQ ID NO:l and/or SEQ ID NO:5. This expression product preferably 
comprises a nucleotide sequence selected from the group consisting of SEQ ID No: 12, SEQ 
10 ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 and SEQ ID No:18. 
The present inventors believe that the presence of at least one of said expression products 
is an etiological factor in B-CLL and that the disease can be treated or cured by inhibiting 
the expression of at least one of such products and/or by inhibiting the effect of such 
product by e.g. rendering it inactive. 

15 

A further preferred object of embodiments of the present invention is to destroy or to 
eliminate the transcription of at least one expression product comprising at least one 
nucleotide sequence selected form the group consisting of SEQ ID No: 12, SEQ ID No: 13, 
SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18 for the 
20 treatment of cancer, such as a poor prognosis sub-type of B-CLL. 

The above destruction or elimination is obtained by applying polynucleotides or 
oligonucleotides in the form of small interfering RNA molecules (siRNA), antisense 
molecules or ribozymes. 

25 

In one aspect the invention relates to a gene therapy vector capable of inhibiting or 
decreasing the formation of an expression product of SEQ ID NO:l and/or SEQ ID NO:5, 
said gene therapy vector preferably encoding a specific si RNA molecule, a specific 
antisense molecule or a specific ribozyme being capable of decreasing or inhibiting the 
30 formation of an expression product of SEQ ID NO:l and/or SEQ ID NO: 5. This gene 

therapy vector can be used for treating B-CLL based on the finding that the AMB-1 gene 
encoded by SEQ ID No:l and/or SEQ ID No:5 is an etiological factor in B-CLL. 

Both SEQ ID No 1 which is a 20,000 nucleotide long sequence and SEQ ID No 5 which is a 
35 80,000 nucleotide long sequence provides several transcriptional products in B-CLL cells in 
patients with poor prognosis B-CLL. Some of the transcriptional products e.g. SEQ ID No 2 
and SEQ ID No 4 consists of two exons (SEQ ID No: 15 and SEQ ID No: 16) separated by 
the same intron. Both mRNA sequences encode an open reading frame (SEQ ID No: 17) 
encoding a 121 amino acid peptide (SEQ ID No 3). 

40 

Accordingly, yet another object of preferred embodiments of the present invention relates 
to a novel class of polypeptides. These may be described as a group of isolated 
polypeptides or proteins comprising or essentially consisting of the amino acid sequence of 
SEQ ID No. 3, or a fragment thereof, or a polypeptide functionally equivalent to SEQ ID 
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No. 3, or a fragment thereof, wherein said fragment or functionally equivalent polypeptide 
has at least 60% sequence identity with the polypeptide of SEQ ID No 3. The polypeptides 
of the present invention may have interleukin or cytokine activity. 

5 In a still further aspect the invention relates to an isolated polynucleotide selected from 
the group consisting of: 

i) a polynucleotide comprising nucleotides of SEQ ID No 5, 

ii) a polynucleotide encoding a polypeptide having the amino acid sequence of SEQ ID No 
10 3, 

iii) a polynucleotide, the complementary strand of which hybridises, under stringent 
conditions, with a polynucleotide as defined in any of i) and ii). 

iv) a polynucleotide which is degenerate to the polynucleotide of iii), and 

v) the complementary strand of any such polynucleotide. 

15 

The polypeptides encoded by the polynucleotides may furthermore 

a) have at least 60 % sequence identity with the amino acid sequence of SEQ ID No 3 
and have interleukin or cytokine activity, 

b) be recognised by an antibody, or a binding fragment thereof, which is capable of 

20 recognising an epitope, wherein said epitope is comprised. within a polypeptide having 
the amino acid sequence of SEQ ID No 3; and/or 

c) be competing with a polypeptide having the amino acid sequence as shown in SEQ ID 
No 3 for binding to at least one predetermined binding partner such as a cytokine 
receptor. 

25 

One further therapeutic application of the present invention is a method of vaccination 
against B-CLL said method comprising immunising an individual against a translational 
product of SEQ ID No:l and/or SEQ ID No: 5. By stimulating the immune system of an 
individual to produce antibodies against the translational product the individual can 
30 become immune towards B-CLL and/or the method can be used as part of therapy. The 
state of the art describes various ways of immunising an individual against a particular 
protein. 

Finally, the invention provides a method for determining an increased or decreased 
35 predisposition for B-CLL comprising determining in a biological sample from an individual a 
germline alteration in a target nucleic acid sequence comprising 150,000 nucleotides, said 
target nucleic acid sequence comprising at least 10 nucleotides of SEQ ID No:l and/or SEQ 
ID No:5. This aspect is based on the finding of the importance of the expression product of 
SEQ ID No:l and/or SEQ ID No:5, and the absence of any detectable expression product 
40 of SEQ ID No:l and/or SEQ ID No: 5 in healthy tissue and in patients with good prognosis 
B-CLL. It is highly likely that the difference is caused by a germline alteration. A germline 
alteration can be targeted by gene therapy methods and by the methods provided in the 
present invention. 
Description of Drawings 
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Figure 1: Overall survival of B-CLL patients by genotype (all stages) The prognostic 
significance of V H homology and cytogenetic aberrations is independent of clinical stage 
(from Krober et aL, 2002 (4)). 

5 Figure 2 (a-d): Survival curves for survival or time to progression based on AMB-1 

expression or IgV H mutational status respectively. Patients are newly diagnosed, untreated 
B-CLL patients (n=34). 

Figure 3: Northern blotting demonstrating the expression level of AMB-1 in various tissue 
10 samples using as a probe an 875 base pair fragment of Exon 3/Seq ID No: 16. UPN 1, 4 
and 7 are unmutated B-CLL patients, UPN 19, 9, 10, 13 and 21 are mutated B-CLL 
patients. Included are RNA samples from normal Colon, Spleen, Bone Marrow and PBL 
(peripheral blood lymphocytes) and RNA from the Ramos and Granta cell lines. Equal 
loading of lanes was confirmed by re-probing with an actin probe (results not shown). 

15 

Figure 4. Alignment of AMB1 with IL4 based on structurel similarity. IL4 is called dliara in 
the alignment. The additional lines indicate the structural similarity. 

Figure 5. A 3D search, where the peptide sequence has been searched for similarity to 
20 known protein or peptide 3D-structures. 

Figure 6. Predicted 3-D structure of AMB-1 compared to the known 3-D structure of human 
IL4. Prediction is performed using SEQ ID No: 3 and the method described in: Enhanced 
Genome Annotation using Structural Profiles in the Program 3D-PSSM. Kelley LA, 
25 MacCallum RM & Sternberg MJE (2000). 3. Mol. Biol. 299(2), 499-520. 

Figure 7. Alignment of the AMB1 peptide sequence with the sequences of IL4, IL3, IL13 
and GM-CSF, based on their structures. 

30 Figure 8. A table showing the tissue types on the MTE array used for dot blotting of AMB-1 
to check for expression in other tissue types. 

Figure 9. Schematic representation of the transcriptional products of the present invention 
compared to the genomic AMB1 sequence (1 & 5). 2 is mRNA short form (SEQ ID No 2). 4 
35 is mRNA long form (SEQ ID No 4). 6 (SEQ ID No:6), 7 (SEQ ID No:7), 8 (SEQ ID No:8), 9 
(SEQ ID No:9), 10 (SEQ ID No: 10) and 11 (SEQ ID No: 11) are alternative transcription 
products all comprising at least one nucleotide sequence selected form the group 
consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, 
SEQ ID No:17 and SEQ ID No:18. 

40 

Figure 10. RT-PCR demonstrating the expression of AMB-1 in B-CLL patients by RT-PCR. 
UPN1-UPN8 are unmutated patients, UPN9-UPN16 are mutated patients. 



WO 2004/046376 



PCT/DK2003/000794 



6 

Detailed description of the invention 

The present invention in particular relates to polynucleotide sequences found to be 
indicative of a poor prognisis of B-CLL and new methods and compositions for treating B- 
CLL in particular poor prognosis of B-CLL. An overview of the sequences disclosed by the 
5 present invention is present here: 

SEQ ID No:l includes 20.000 bp human genome sequence, derived from BAC clone acc. 
no. AC063949. It Includes the region encoding the mRNAs corresponding to the cDNAs 
described in Seq ID Nos:2, 4, 6, 7, 8, 9, 10, and 11 and possible up- and down-stream 
10 regulatory sequences. Seq ID No:l includes a subset of the sequence described in Seq ID 
No: 5. The sequence is derived from human 12q21-22. 

SEQ ID No:2 includes a cDNA corresponding to a putative mRNA transcript that includes 
the region encoding the peptide sequence in Seq ID No: 3 and up- and downstream 
15 regions. It is transcribed from the +strand of Seq ID No:l and Seq ID No: 5 from human 
12q21-22. From position 2317, this sequence is identical to Seq ID No:4. 

Seq ID No:3 contains a peptide sequence encoded by some mRNAs transcribed from the 
region on human chromosome 12q21-22 included in Seq ID No:l and Seq ID No:5. It is 
20 encoded by the mRNA sequences identified as cDNAs in Seq ID Nos 02, 04, 09 and 11. 

Seq ID No:4 includes a cDNA corresponding to a putative mRNA transcript that includes 
the region encoding the peptide sequence in Seq ID No: 3 and up- and downstream 
regions. It is transcribed from the +strand of Seq ID No:l and Seq ID No: 5 from human 
25 12q21-22. 

Seq ID No:5 includes 80.000 bp human genome sequence, derived from BAC clone acc. 
no. AC063949. It is an expansion of the genomic sequence included in Seq ID No:l. which 
is contained within this sequence. It includes the region encoding the mRNAs 
30 corresponding to the cDNAs described in Seq ID Nos 02, 04, 06, 07, 08, 09, 10 and 11 and 
possible up- and down-stream regulatory sequences. The sequence is derived from human 
12q21-22. 

Seq ID No:6 corresponds to a cDNA detected by cDNA cloning, corresponding to an mRNA 
35 transcript. It includes two exons. It is transcribed from the +strand of Seq ID No:l and 
Seq ID No:5 from human 12q21-22. 

Seq ID No:7 corresponds to a cDNA detected by cDNA cloning, corresponding to an mRNA 
transcript. It includes three exons, the first and third are identical to the two exons in Seq 
40 ID No:6. It is transcribed from the +strand of Seq ID No:l and Seq ID No:5.doc from 
human 12q21-22. 

Seq ID No:8 corresponds to a cDNA detected by cDNA cloning, corresponding to an mRNA 
transcript. It includes two exons, the last is also present as exon-2 in a human cDNA clone 
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(sequence acc. no. BC036936) (Seq ID No: 9.doc). It is transcribed from the -fstrand of 
Seq ID No:5.doc from human 12q21-22. 

Seq ID No:9 corresponds to a human cDNA sequence (sequence acc. no. BC036936). It is 
5 transcribed from the -fstrand Seq ID No:5 from 12q21-22. We have not cDNA cloned this 
cDNA, but a splice variant (Seq ID No:8), where exon-2 of this sequence was spliced to 
exon-1 of Seq ID No:6 was detected by cDNA cloning. It is transcribed from the +strand of 
Seq ID No: 5 from human 12q21-22. 

10 Seq ID No: 10 corresponds to a cDNA detected partly by cDNA cloning, partly by PCR 
analysis, corresponding to an mRNA transcript. It includes two exons, exon-1 includes the 
region encoding Seq ID No:3 and exon-1 from Seq ID No:6; exon-2 is identical to exon-2 
in Seq ID No:6 and exon-3 in Seq ID No:7. It is transcribed from the +strand of Seq ID 
No:l.doc and Seq ID No:5 from human 12q21-22. 

15 

Seq ID No:ll corresponds to a cDNA detected by cDNA cloning, corresponding to an 
mRNA transcript. It includes one exon. The sequence includes the the region that encodes 
Seq ID No:3, exons 2 and 3 from Seq ID No:7 and the region between those exons. It is 
transcribed from the +strand of Seq ID No:l and Seq ID No: 5 from human 12q21-22. 

20 

Seq ID No: 12 Is an exon sequence. It corresponds to the first exon in Seq ID No: 9 It is 
transcribed from the +strand of Seq ID No:5 from human 12q21-22. 

Seq ID No: 13 Is an exon sequence. It corresponds to the first exon in Seq ID No: 6, 7, 8, 
25 and It is included in Seq ID No: 2. It is transcribed from the +strand of Seq ID No:l and 5 
from human 12q21-22. 

Seq ID No:14 Is an exon sequence. It is identical to Seq ID No: 13, but with an additional 
GT dinucleotide at the 3'end, caused by the use of an alternative splice site. It can replace 
30 Seq ID No: 13 as the first exon in Seq ID No: 6, 7, 8, and be included in Seq ID No: 2. It is 
transcribed from the +strand of Seq ID No:l and 5 from human 12q21-22. 

Seq ID No: 15 Is an exon sequence. It corresponds to the second exon in Seq ID No: 7 
and it is included in Seq ID No: 2, 4, 10 and 11. It is transcribed from the + strand of Seq 
35 ID No:l and 5 from human 12q21-22. 

Seq ID No: 16 Is an exon sequence. It corresponds to the third exon in Seq ID No:7, it is 
the second exon in Seq ID No:6 and 11 and it is included in Seq ID No:2 and 4. It is 
transcribed from the +strand of Seq ID No: 5 from human 12q21-22. 

40 

Seq ID No:17 Is the sequence encoding the peptide in Seq ID No: 3. It is included in Seq 
ID No:2, 4, 10 and 11. It is transcribed from the +strand of Seq ID No:5 from human 
12q21-22. 



WO 2004/046376 



PCT7DK2003/000794 



8 

Seq ID No:18 Is an exon sequence. It corresponds to the second exon in Seq ID No:8 
and 9. It is transcribed from the +strand of Seq ID No:5 from human 12q21-22. 



5 Methods of diagnosis 

One important aspect of the present invention relates to diagnosis of a subtype of B-cell 
chronic lymphocytic leukaemia (B-CLL) having poor prognosis. A further important aspect 
of the invention relates to prognosis of B-CLL. These methods are based on the discovery 
by the present inventors that an expression product which comprise at least one nucleotide 

10 sequence selected from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID 
No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18 is (are) only 
present in particular subtypes of B-CLL associated with poor prognosis and completely 
absent in other subtypes of B-CLL and in healthy tissue (see in particular example 2). By 
completely absent is meant that the expression products are not detected in any of the 

15 other tissue types with the methods used in the appended examples. This is indicative of a 
complete absence of any transcript or a very low level of transcript in the other tissue 
types. 

The expression product is encoded by SEQ ID No 1 and/or SEQ ID No 5 r and the 
20 expression product is selected from the group consisting of transcriptional products and 
translational products. 

Thus, the present invention relates to a method for detecting the presence or absence of 
at least one expression product, wherein the at least one expression product comprise a 
25 nucleotide sequence selected from the group consisting of SEQ ID No:12, SEQ ID No: 13, 
SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 and SEQ ID No:18 in a 
biological sample isolated from an individual for establishing a differentia! diagnosis of B- 
CLL or for determining the prognosis of the B-CLL. 
w 

30 "Expression product" is herein meant to be a product which is the result of the expression 
of a polynucleotide such as DNA sequence, e.g. a genomic DNA sequence, and is in the 
form of either a polypeptid or in the form of a polynucleotide, i.e. an expression product 
can be selected from the group consisting of a transcriptional product and a translational 
product. In the case where the expression product is a polynucleotide, said polynucleotide 

35 is preferably mRNA selected from the group consisting of mRNA, pre-mRNA, pre-pro- 
mRNA . 

A "transcriptional product" or a "transcription product" is herein meant to be a product 
resulting from a transcription of a polynucleotide such as a DNA molecule, preferably a 
40 genomic DNA molecule. A transcriptional product is inherently a nucleotide, such as an 
oligonucleotide or a polynucleotide. 
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A "translational product" or a "translation product" is herein meant to be a product 
resulting from a translation of a transcriptional product such as a mRNA. A translational 
product is inherently a oligopeptide or a polypeptide. 

5 The expression product of the present invention has almost exclusively been found as 
transcription products in patients with poor B-CLL prognosis. Based on the experimental 
data presented in the herein, the inventors expect that it turns out that the subtype of B- 
CLL is characterised solely or better by the presence of a transcriptional or translational 
product which comprise a sequence selected from the group consisting of SEQ ID No: 3, 
10 SEQ ID No:12, SEQ ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 
and SEQ ID No:18. 

Preferably the individual is a mammal, more preferably a human. It is also expected that 
the gene encoded by SEQ ID No 1 and/or SEQ ID No 5 and the expression products 
15 derived from said gene can be used as a diagnostic tool in other species in particular in 
mammals selected from the group: domestic animals such as cow, horse, sheep, pig; and 
pets such as cat or dog. 

In the case that the expression product is a transcriptional product, this transcriptional 

20 product just needs to comprise at least one of the nucleotide sequences selected from the 
group consisting of SEQ ID No:12, SEQ ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID 
No: 16, SEQ ID No: 17 and SEQ ID No: 18. These transcriptional products will preferably be 
derived from SEQ ID No 1 and/or SEQ ID No 5 and may be in the form of mRNA or any 
pre- or pro-forms of said mRNA. As described, the transcriptional product may comprise at 

25 least one of the nucleotide sequences selected from the group consisting of SEQ ID No: 12, 
SEQ ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 and SEQ ID 
No: 18, such as one of the nucleotide sequences selected from the group consisting of SEQ 
ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and 
SEQ ID No: 18, such as two of the nucleotide sequences selected from the group consisting 

30 of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID 
No: 17 and SEQ ID No: 18, such as 3 of the nucleotide sequences selected from the group 
consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, 
SEQ ID No:17 and SEQ ID No:18, e.g. 4 of the nucleotide sequences selected from the 
group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID 

35 No: 16, SEQ ID No: 17 and SEQ ID No: 18, such as 5 of the nucleotide sequences selected 
from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, 
SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18 and ultimately the transcriptional product 
may comprise all of the six nucleotide sequences selected from the group consisting of 
SEQ ID No:12, SEQ ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 

40 and SEQ ID No: 18. The transcriptional product of the present invention can have any 
sequence which is a result of combining the nucleotide sequences selected from the group 
consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, 
SEQ ID No: 17 and SEQ ID No: 18 as long as the specific nucleotide sequences selected 
from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, 
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SEQ ID (Mo: 16, SEQ ID No: 17 and SEQ ID No: 18 can be identified as intact sequences in 
the transcriptional product. 

Examples of transcriptional products in the form of specific mRNAs which comprise at least 
5 one of the nucleotide sequences selected from the group consisting of SEQ ID No: 12, SEQ 
ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 and SEQ ID No:18 
are the sequences corresponding to SEQ ID No 2 (short cDNA clone) SEQ ID No 4 (long 
cDNA clone) SEQ ID No 6, SEQ ID No 7, SEQ ID No 8, SEQ ID No 9, SEQ ID No 10 and 
SEQ ID No 11. These mRNA sequences have been found in patients with poor prognosis. 

10 

It is obvious for a person skilled in the art that any fragments of SEQ ID No: 2, SEQ ID 
No:4, SEQ ID No:6, SEQ ID No:7 # SEQ ID No:8, SEQ ID No:9 f SEQ ID No: 10 and SEQ ID 
No: 11 will have the same diagnostic value as long as the nucleotide sequences selected 
from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, 
15 SEQ ID No:16, SEQ ID No:17 and SEQ ID No:18 can be found in the fragments. 

The mRNA sequence may be detected in a sample using hybridisation techniques. In 
particular when more than one analysis is to be performed at the same time it is 
advantageous to use a DNA array comprising e.g. an oligomer of at least 15 consecutive 
20 bases selected from the group consisting of SEQ ID No:2, SEQ ID No:4, SEQ ID No:6, SEQ 
ID No:7, SEQ ID No:8, SEQ ID No:9, SEQ ID No:10, SEQ ID No:ll, SEQ ID No:12, SEQ ID 
No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 and SEQ ID No:18. 

Another way of detecting the presence or absence of the transcriptional product is by 
25 specifically amplifying the transcriptionals product having a sequence corresponding to 
SEQ ID No 2, 4, 6, 7, 8, 9, 10 or 11 or fragments thereof. This can be done by selecting 
primer pairs which cause only the amplification of these sequences. 

Generally, hybridisation techniques are selected from not limited to the group consisting of 
30 in situ hybridisation, northern blots, Southern blots, dot blots and PCR based techniques. 

A non-limiting list of PCR based techniques include rt-PCR, quantitative PCR and realtime 
PCR. 

35 According to yet another embodiment, the translational product is a protein encoded by a 
polynucleotide selected from the group consisting of SEQ ID No:l, SEQ ID No: 5, SEQ ID 
No:2, SEQ ID No:4, SEQ ID No:6, SEQ ID No:7, SEQ ID No:8, SEQ ID No:9, SEQ ID No:10 
and SEQ ID No: 11. Detection of this protein can be done with state of the art methods 
including the detection with an antibody directed against said protein, such as Western 

40 blotting, more preferably by using a fluorescently labelled antibody, preferably wherein the 
method comprises the use of flowcytometry, such as FACS. Other methods include but are 
not limited to gel electrophoresis, gel filtration, ion exchange chromatography, FPLC, mass 
spectrometry and immunohlstochemistry. 
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Preferably, said protein is selected from the group comprising SEQ ID No 3 (protein), or a 
protein sharing at least 60 % sequence identity with SEQ ID No 3. The protein with the 
amino acid sequence set forth in SEQ ID No 3 is the longest open reading frame in the 
cDNA sequence of SEQ ID No 2 or 4. 

5 

In a specific embodiment of the present invention is a method for determining whether an 
individual has a B-CLL sub-type with poor prognosis, the method comprising determining 
the level of an expression product which comprise a nucleotide sequence selected from the 
group consisting of SEQ ID No:12, SEQ ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID 

10 No: 16, SEQ ID No: 17 and SEQ ID No: 18 of said individual, and indicating the individual as 
having a B-CLL sub-type with poor prognosis if the level of the expression product is at or 
beyond a discriminating value and indicating the individual as not having a B-CLL sub-type 
with poor prognosis if the level of the expression product is not at or beyond the discrimi- 
nating value, the discriminating value being a value which has been determined by 

15 measuring the level of the expression product which comprise a nucleotide sequence 

selected from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID 
No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18 in both a healthy control 
population and a population with known B-CLL sub-type with poor prognosis, thereby 
determining said discriminating value which identifies the B-CLL sub-type population 

20 having a poor prognosis. 

In this method the individual may be a member of an unselected population or be a 
member of a population already identified as having a B-CLL sub-type with a poor 
prognosis. 

25 

The above method may be performed such that the determination is performed at several 
time points at intervals as part of a monitoring of a cancer patient after or during the 
treatment for primary cancer. 

30 The methods described so-far relate to the determination of the presence or absence of an 
expression product which comprise a nucleotide sequence selected from the group 
consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, 
SEQ ID No: 17 and SEQ ID No: 18. By quantitatively measuring the amount of an 
expression product which comprise a nucleotide sequence selected from the group 

35 consisting of SEQ ID No:12, SEQ ID No:13, SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, 
SEQ ID No: 17 and SEQ ID No: 18 in a biological sample isolated from an individual, it is 
possible to predict the progression of B-CLL in an individual. 

In one embodiment the quantitative measurement is performed during treatment to 
40 estimate the efficiency of such treatment. 

A preferred embodiment of the above diagnostic and prognostic methods is a method for 
detecting the presence or absence an expression product, wherein said at least one 
expression product comprise the nucleotide sequence of SEQ ID No: 15 in a biological 
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sample isolated from an individual for establishing a differential diagnosis of B-CLL or for 
determining the prognosis of the B-CLL. 

A further preferred embodiment of the above diagnostic and prognostic methods is a 
5 method for detecting the presence or absence an expression product, wherein said at least 
one expression product comprise the nucleotide sequence of SEQ ID No: 16 in a biological 
sample isolated from an individual for establishing a differential diagnosis of B-CLL or for 
determining the prognosis of the B-CLL. 

10 Yet a further preferred embodiment of the above diagnostic and prognostic methods is a 
method for detecting the presence or absence an expression product, wherein said at least 
one expression product comprise the nucleotide sequence spanning the junction sequence 
between Exon-2 (SEQ ID No: 15) and Exon-3 (SEQ ID No: 16) in a biological sample 
isolated from an individual for establishing a differential diagnosis of B-CLL or for 

15 determining the prognosis of the B-CLL. 

The nucleotide sequence spanning the junction between Exon-2 and Exon-3 is the last 20 
nucleotides of the 3'-end of SEQ ID No: 15 and the first 20 nucleotides of the 5'-end of SEQ 
ID No:16. 

20 

The "junction sequence" between two nucleotide sequences, such as two exons, is herein 
defined as the at least 20 3'-nucleotides of the first exon which is located 5' relative to the 
second exon and the at least 20 S'-nucleotides of the second exon which is located 3' 
relative to the first exon. 

25 

For all diagnostic applications of the present invention, the biological sample may be 
selected from the group comprising blood, serum, plasma, urine, saliva, lymph node 
biopsy, bone marrow, spinal liquid, spleen biopsy, and liver biopsy. The cells to be 
assessed in a sample are preferably leukocytes, mononuclear leukocytes or lymphocytes or 
30 B-lymphocytes. 

A further embodiment of the present invention also includes a diagnostic kit for ex vivo or 
In situ diagnosis of a subtype of B-cell chronic lymphocytic leukaemia (B-CLL) in a 
individual, the kit comprising a detector molecule capable of detecting the presence or 
35 absence of at least one expression product, wherein said at least one expression product 
comprise a nucleotide sequence selected from the group consisting of SEQ ID SEQ ID 
No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and 
SEQ ID No: 18 in a biological sample isolated from the individual. 

40 The detector molecule is preferably a nucleotide and even more preferably a nucleotide 
capable of hybridising to a nucleotide sequence selected from the group consisting of SEQ 
ID SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID 
No: 17 and SEQ ID No: 18 under stringent condition. 
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B-CLL therapy 

With the identification of a new sub-type of B-CLL having a poor prognosis, the present 
inventors also provide methods for treatment of B-CLL in such patients. This method is 
based on the finding that transcription products comprising these sequence products are 
5 present in B-CLL ceils of patients with the poor prognosis. By modifying the activity and/or 
level of these transcription products, a treatment and/or cure for B-CLL is provided. 

Accordingly, in a therapeutic aspect of the present invention there is provided a method of 
treating a B-CLL sub-type with poor prognosis comprising administering to an individual 
10 with a poor prognosis B-CLL diagnosis a compound capable of decreasing or inhibiting the 
formation of an expression product which comprise a nucleotide sequence selected from 
the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ 
ID No: 16, SEQ ID No: 17 and SEQ ID No: 18. 

15 One such method is based on administering an oligonucleotide capable of inhibiting 

transcription from SEQ ID No 1 and/or SEQ ID No 5. Said oligonucleotide may comprises 
at least 8-10 consecutive nucleotides from the sequence of SEQ ID No 1. These sequences 
constitute the putative promoter sequences controlling the transcription transcription 
products which comprise a nucleotide sequence selected from the group consisting of SEQ 

20 ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and 
SEQ ID No: 18. The oligonucleotides bind specifically to the promoter sequences and inhibit 
transcription of the gene. Such oligonucleotides may comprises nucleotide monomers 
selected from the group: DNA, RNA, LNA, PNA, methylated DNA, methylated RNA, more 
preferably PNA or LNA. 

25 

In a more preferred embodiment the therapeutic methods comprise administering an 
oligonucleotide capable of binding to a transcriptional product and preventing translation 
by destroying the transcriptional product. One particularly preferred embodiment of this 
aspect is RNA interference (RNAi) oligonucleotides. 

30 

The discovery of the phenomenon RNAi has revealed an entirely new level of gene 
regulation in eukaryotic cells. It is based on the observation that the presence of long 
double stranded RNA (dsRNA) in a cell almost completely eliminates the expression of the 
gene having the same sequence, whereas expression of other unrelated genes are left 

35 undisturbed. Although this observation had been know for time in plants as 

posttranscriptional gene silencing (PTGS) it was not before it was characterised as a 
general mechanism throughout the animal kingdom that its potentials were fully 
appreciated. Over the last few years it has been developed as a robust technique to knock 
down any desirable gene in worms and flies, and quickly a large body of information was 

40 gathered about the function of genes in these organisms. Due to the activation of the 
interferon system by long dsRNA the RNAi method was at that time not applicable in a 
mammalian system. 
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A key observation that allowed the harnessing of RNAi as a tool for regulating gene 
expression in mammals was the observation that chemically synthesised oligo-mer small 
interfering RNAs (siRNA) effectively suppress gene expression in several human cell lines 
without inflecting interferon response. This has triggered new promises for siRNA as a 
5 therapeutic drug in humans. 

RNAi works by hybridising specifically to the mRNA transcribed by the cell to form a 
(partly) double stranded RNA molecule. This is recognised as a double stranded molecule 
by the cell's own nucleases, which degrade them. 

10 

In order for the technique to work efficiently, the siRNA oligonucleotide comprises a 
sequence of 5-30 consecutive nucleotides which is the complementary sequence of the 
nucleotide sequences selected from the group consisting of SEQ ID No: 12, SEQ ID No: 13, 
SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18. By 

15 targeting at least one of the nucleotide sequences selected from the group consisting of 
SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 
and SEQ ID No: 18, the transcriptional products characterised by the nucleotide sequences 
from the group of SEQ ID No:2, SEQ ID No:4, SEQ ID No:6, SEQ ID No:7, SEQ ID No:8, 
SEQ ID No:9, SEQ ID No: 10 and SEQ ID No: 11 will be eliminated. Example 5 shows that 

20 cells characteristic for the poor prognosis B-CLL sub-type can be eliminated by destroying 
the herein mentioned transcription products. 

By "complementarity" is meant a nucleic acid that can form hydrogen bond(s) with other 
DNA or RNA sequence by either traditional Watson-Crick or other non-traditional types of 
25 base-paired interactions, e.g. Hoogsteen type. 

Preferred siRNA molecules of the present invention are between 5 to 30 nucleotides long, 
such as 8-30 nucleotides long, such as 8-25 nucleotides, e.g. 8-24 nucleotides, e.g. 8-23 
nucleotides, e.g. 8-22 nucleotides, e.g. 8-21 nucleotides, such as 8-20 nucleotides, e.g. 9- 

30 23 nucleotides, e.g. 10-23 nucleotides, such as 11-23 nucleotides, e.g. 12-23 nucleotides 
such as 13-23 nucleotides, e.g. 14-23 nucleotides, e.g. 15-23 nucleotides, such as 16-23 
nucleotides, such as 8 nucleotides, 9 nucleotides, 10 nucleotides, 11 nucleotides, 12 
nucleotides, 13 nucleotides, 14 nucleotides, 15 nucleotides, 16 nucleotides, 17 nucleotides, 
18 nucleotides, 19 nucleotides, 20 nucleotides, 21 nucleotides, 22 nucleotides, 23 

35 nucleotides, 24 nucleotides, 25 nucleotides, 26 nucleotides, 27 nucleotides, 28 nucleotides, 
29 nucleotides, such as 30 nucleotides long. 

RNAi oligonucleotides may be administered to the cell, or a vector may be transfected into 
the cells, said vector comprising a promoter region capable of directing the expression of 
40 at least one RNAi oligonucleotide. Due to the very restricted expression of the AMB-1 gene, 
it is not important only to target the RNAi oligos or the vectors to B-CLL cells. 

One way of targeting to blood cells comprises using a heparin receptor for targeting to 
blood cells. 
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Another way of targeting the transcriptional products which comprise at least one 
nucleotide sequence selected from the group consisting of SEQ ID No: 12, SEQ ID No: 13, 
SEQ ID No:14, SEQ ID No:15, SEQ ID No:16, SEQ ID No:17 and SEQ ID No:18 is to use 
5 an antisense construct comprising a promoter sequence capable of directing the 

transcription of at least part of the antisense equivalent of SEQ ID No 1 or 2 or 4 or 5 or 6 
or 7 or 8 or 9 or 10 or 11 specifically to the poor prognosis B-CLL sub-type. 

When desired targeting to B-CLL cells can be performed using the CD 19 or CD20 receptor. 
10 The CD19 receptor is particularly preferred since it internalises its ligand. 

In a further therapeutic embodiment the compound is a gene therapy vector comprising a 
promoter sequence operably linked to a sequence coding for a protein capable of inhibiting 
cell division in the cell and/or capable of killing the cell, said promoter sequence being a 
15 tissue specific promoter capable of directing expression only in B cells, more preferably 
only in B-CLL cells. One particularly preferred promoter sequence is the extremely ceil 
specific promoter of SEQ ID No:l or SEQ ID No:5. When this promoter is used targeting of 
the suicide vector is not very important, since it will only be active in the cells in which 
AMB-1 is expressed and these are the cells to be targeted by the suicide gene. 

20 

Deletion studies will determine the exact length of the promoter sequence counted from 
the transcription start site. Accordingly, the promoter may comprise at least 100 
nucleotides of Seq_ID:l or Seq_ID:5, such as at least 200 nucleotides, for example at 
least 300 nucleotides, such as at least 400 nucleotides, for example at least 500 

25 nucleotides, such as at least 600 nucleotides, for example at least 700 nucleotides, such as 
at least 800 nucleotides, for example at least 900 nucleotides, such as at least 1000 
nucleotides, for example at least 1100 nucleotides, such as at least 1200 nucleotides, for 
example at least 1300 nucleotides, such as at least 1400 nucleotides, for example at least 
1500 nucleotides, such as at least 1600 nucleotides, for example at least 1700 nucleotides, 

30 such as at least 1800 nucleotides, for example at least 1900 nucleotides, such as at least 
2000 nucleotides, for example at least 2500 nucleotides, such as at least 3000 nucleotides, 
for example at least 3500 nucleotides, such as at least 5000 nucleotides, for example at 
least 10,000 nucleotides. 

35 The specificity of expression of mRNAs described by the present invention are striking. The 
RT-PCR data and the Northern blot data using the dot-blot disclose that the mRNAs of the 
present invention are expressed either at very low levels in other tissues or only in the B- 
CLL patients where one can detect it. 

40 Thus one embodiment of the present invention is the use of the promotor region for use in 
gene therapy. The promotor is defined as any sequence within SEQ ID No:l and SEQ ID 
No:5 that directs the formation of an expression product which comprise a nucleotide 
sequence selected from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID 
No: 14, SEQ ID No: 15, SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18, said expression 
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product being any of the sequences selected from the group consisting of SEQ ID No:2, 
SEQ ID No:4, SEQ ID No:6, SEQ ID No:7, SEQ ID No:8, SEQ ID No:9, SEQ ID No: 10 and 
SEQ ID No: 11 in B-CLL cells or any other cell or tissue types in which any of the sequences 
are transcribed. 

5 

One way that one can use the promotor region in gene therapy is to make a gene therapy 
construct where the promotor drives the expression of a cell suicide gene such as but not 
limited to the gene for HSV-1 thymidine kinase, the varicella-zoster, virus thymidine 
kinase gene, E.Coli cytosine deaminase, the nitroreductase gene or the E.Coli Deo gene 
10 (Yazawa et al. & Kirn D. et al.). This would allow for selective expression of the suicide 
genes in B-CLL cells. 

Alternatively, the promotor could be used for a selective expression of genes that could 
have curative effects when expressed in B-CLL cells, but unwanted effects if expressed 
15 ubiquitously. 

Also, one embodiment of the present invention relates to the use of the promotor region 
for use in screening assays where the promotor is linked to a reporter gene and 
transfected into B-CLL cells in which the reporter gene will be expressed. This approach 
20 would allow for easy screening for compounds that would turn off the expression of the 
reporter gene for example by killing the cell. 

A presently preferred embodiment relates a gene therapy vector of the present invention 
comprising an oligonucleotide capable of inhibiting transcription from SEQ ID No 1 and/or 
25 SEQ ID No 5, wherein the promoter is a B-CLL specific promoter, which may or may not be 
operably linked to a protein selected from the group comprising HSV-1 thymidine kinase, 
the varicella-zoster, virus thymidine kinase gene, E.Coli cytosine deaminase, the 
nitroreductase gene or the E.Coli Deo gene. 

30 In one embodiment the compound is a therapeutic antibody directed against a polypeptide 
having the amino acid sequence of SEQ ID No 3, preferably wherein said antibody is a 
human or humanised antibody. Another possibility is to identify a modulator of binding of 
SEQ ID No 3 to its receptor within or outside the cell and to administer this modulator to 
the cells. 

35 

4-heIical cytokines 

A further object of preferred embodiments of the present invention is an isolated 
polypeptide comprising an amino acid sequence selected from the group consisting of: 

40 i) an amino acid sequence of SEQ ID NO: 3, 

ii) an amino acid sequence having at least 60% sequence identity compared to 
the full length sequence of SEQ ID NO: 3 
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ii) a fragment of SEQ ID NO:3 having at least 60% sequence identity compared 
to the full length sequence of SEQ ID NO: 3. 

The protein encoded by SEQ ID No 1 and/or SEQ ID No 5 shares a very small sequence 
5 identity with any known protein. However, it has been possible to use 2D and 3D analytical 
tools to identify the protein as a 4-helical cytokine. The 3D structure of the protein is very 
similar to 4-helical cytokines and in particular to IL4. 

IL4 is a very important cytokine in B-CLL biology. IL4 is not expressed by B-CLL cells, but 
10 the IL4 receptor is found on the cells. The IL4 that stimulates B-CLL cells is believed to be 
produced by T-lymphocytes. The role of IL4 in B-CLL biology is complicated. It has been 
suggested that IL4 can inhibit B-CLL DNA synthesis and proliferation. Other reports 
demonstrated that IL4 protects B-CLL cells from apoptosis by upregulating Bcl-2, and IL4 
was shown to inhibit apoptosis without stimulating proliferation. Recently, a clinical study 
15 in Sweden has confirmed these in vitro studies since IL4 administration to B-CLL patients 
resulted in increased numbers of B-CLL cells in the blood, suggesting that IL4 had a 
stimulatory or anti-apoptotic effect on the B-CLL cells in vivo (Lundin, et al.). 

In many systems the effects of IL13 are largely similar to those of IL4, but IL13 is slightly 
20 less potent that IL4. It is unclear whether B-CLL cells express IL13, but the cells do 

express the IL13 receptor. The effects of IL13 in B-CLL are controversial. While Chaouchi 
et al. suggested that IL13, like IL4 protects B-CLL cells from apoptosis (Chaouchi et al), 
studies by Fluckiger et al. suggest that this is not the case (Fluckiger et al.). 

25 The combined finding of 2D and 3D structure similarity to 4-helical cytokines and the 
importance of IL4 in B-CLL strongly suggests that the novel class of proteins of which the 
AMB-1 protein is one representative are cytokines. 

These polypeptides constitute a novel class of proteins sharing 2D and 3D structure 
30 similarities with 4-helical cytokines. In a preferred embodiment, the isolated polypeptide 
comprises or essentially consists of the amino acid sequence of SEQ ID No. 3 or a 
fragment thereof. This particular protein at least can be used for diagnosis, for raising 
antibodies for use in therapy against B-CLL, and for protective or therapeutic immunisation 
of an individual against B-CLL. 

35 

Consequently, the isolated polypeptide preferably has interleukin activity or interleukin like 
activity, such as having IL3, IL13, GM-CSF, TGF-0, IGF activity, more preferably having 
IL4 activity or IL4 like activity. 

40 Probably the isolated polypeptides are capable of forming homo- or hetero-oligomer with 
each other and among themselves. Such oligomers are also within the scope of the present 
invention. Such oligomers may comprise at least one isolated polypeptides as defined in 
any the present invention, such as a dimer, a trimer, a quatramer, a quintamer, a 
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hexamer, an octamer, a decamer, a dodecamer. In biological systems the activity may be 
attributed only to dimer or higher -mer. 

The protein defined by SEQ ID No 3 shares very little sequence identity with known 
5 cytokines and interleukines and as a matter of fact very little sequence identity with any 
known protein. Consequently the present inventors contemplates that the group comprises 
functionally equivalent polypeptide sharing at least 60% sequence identity with SEQ ID No 
3, more preferably at least 70% sequence identity, more preferably at least 80 % 
sequence identity, such as at least 90 % sequence Identity, for example at least 95 % 
10 sequence identity, such as at least 97 % sequence identity, for example at least 98 % 
sequence identity. 

Activity as a cytokine or interleukin can be assessed in a biological assay where the 
polypeptide is contacted with a cytokine dependent cell line. Accordingly, polypeptides 
15 with cytokine or interleukin like activity can also be identified by similar methods. 

One approach to assess cytokine/interleukin activity in a biological assay is to express the 
CDS (SEQ ID No: 17) reading frame in a baculovirus system (Invitrogen, Carlsbad, USA) 
and purify the protein . The recombinant protein can be assayed in cytokine induced 
20 proliferation assays as described in general in the eBioscience catalog & Reference Manual 
2002 p.260-262 (eBioscience, San Diego, USA). In particular IL4 activity can be 
determined using the CTh4S cell line as described by Petersen et al (see Example 8). 

The promoter sequence (which forms part of SEQ ID No 1 and/or SEQ ID No 5) and the 
25 coding sequences (SEQ ID No:3) can be used in various aspects of gene therapy and 
immunotherapy. 

Further polynucleotide sequences from other individuals or other species with the same 
function can be isolated by one of the following methods, which each form independent 
30 aspects of the present invention. 

A first method for identifying a nucleotide sequence encoding a 4-helical cytokine 
comprises the steps of: 

i) isolating mRNA from a biological sample, 

35 ii) hybridising the mRNA to a probe comprising at least 10 nucleotides of the coding 
sequence of SEQ ID No 1 and/or SEQ ID No 5 under stringent conditions, 
Hi) determining the nucleotide sequence of a sequence capable of hybridising under step 

ii) , and 

iv) determining the presence of an open reading frame in the nucleotide sequence 
40 determined under step iii). 

A second method for identifying a nucleotide sequence encoding a 4-helical cytokine is a 
computer assisted method comprising the steps of 
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i) performing a sequence similarity search of at least 10 nucleotides of the coding 
sequence SEQ ID No 1 and/or SEQ ID No 5, 

ii) aligning "hits" to said coding sequence, 

iii) determining the presence of an open reading frame in the "hits". 

It is highly likely that other similar polypeptides encoding further 4-heiical cytokines can be 
found in other individuals and/or other species of mammals. In particular, individuals of 
other geographical origin may carry genes which differ from the polynucleotides of the 
present invention. It is also conceivable that similar sequences can be found in closely and 
even in distantly related species. 

Functional Equivalents 

Modification and changes may be made in the structure of the peptides of the present 
invention and DNA segments which encode them and still obtain a functional molecule that 
encodes a protein or peptide with desirable characteristics. The following is a discussion 
based upon changing the amino acids of a protein to create an equivalent, or even an 
improved, second-generation molecule. The amino acid changes may be achieved by 
changing the codons of the DNA sequence, according to the genetic code. 

For example, certain amino acids may be substituted for other amino acids in a protein 
structure without appreciable loss of interactive binding capacity with structures such as, 
for example, antigen-binding regions of antibodies, binding sites of receptors, or binding 
sites on substrate molecules. Since it is the interactive capacity and nature of a protein 
that defines that protein's biological functional activity, certain amino acid sequence 
substitutions can be made in a protein sequence, and, of course, its underlying DNA coding 
sequence, and nevertheless obtain a protein with like properties. It is thus contemplated 
by the inventors that various changes may be made in the peptide sequences of the 
disclosed compositions, or corresponding DNA sequences which encode said peptides 
without appreciable loss of their biological utility or activity. 

In making such changes, the hydropathic index of amino acids may be considered. The 
importance of the hydropathic amino acid index in conferring interactive biologic function 
on a protein is generally understood in the art (Kyte and Doolittle, 1982, incorporate 
herein by reference). It is accepted that the relative hydropathic character of the amino 
acid contributes to the secondary structure of the resultant protein, which in turn defines 
the interaction of the protein with other molecules, for example, enzymes, substrates, 
receptors, DNA, antibodies, antigens, and the like. Each amino acid has been assigned a 
hydropathic index on the basis of their hydrophobicity and charge characteristics (Kyte and 
Doolittle, 1982), these are: isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine 
(+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); 
threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine 
(-3.2); glutamate (-3.5); glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (- 
3.9); and arginine (-4.5). 
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It is known in the art that certain amino acids may be substituted by other amino acids 
having a similar hydropathic index or score and still result in a protein with similar . 
biological activity, ie. still obtain a biological functionally equivalent protein. In making 
such changes, the substitution of amino acids whose hydropathic indices are within ±2 is 
5 preferred, those which are within ±1 are particularly preferred, and those within ±0.5 are 
even more particularly preferred. It is also understood in the art that the substitution of 
like amino acids can be made effectively on the basis of hydrophilicity. U.S. Pat. No. 
4,554,101, incorporated herein by reference, states that the greatest local average 
hydrophilicity of a protein, as governed by the hydrophilicity of its adjacent amino acids, 
10 correlates with a biological property of the protein. 

As detailed in U.S. Pat. No. 4,554,101, the following hydrophilicity values have been 
assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0±1); 
glutamate (+3.0±1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); 

15 threonine (-0.4); proline (-0.5±1); alanine (-0.5); histidine (-0.5); cysteine (-1.0); 
methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); 
phenylalanine (-2.5); tryptophan (-3.4). It is understood that an amino acid can be 
substituted for another having a similar hydrophilicity value and still obtain a biologically 
equivalent, and in particular, an immunologically equivalent protein. In such changes, the 

20 substitution of amino acids whose hydrophilicity values are within ±2 is preferred, those 
which are within ±1 are particularly preferred, and those within ±0.5 are even more 
particularly preferred. 

As outlined above, amino acid substitutions are generally therefore based on the relative 
25 similarity of the amino acid side-chain substituents, for example, their hydrophobicity, 

hydrophilicity, charge, size, and the like. Exemplary substitutions which take various of the 
foregoing characteristics into consideration are well known to those of skill in the art and 
include: arginine and lysine; glutamate and aspartate; serine and threonine; glutamine 
and asparagine; and valine, leucine and isoleucine. 

30 

Functional equivalents and variants are used interchangably herein. In one preferred 
embodiment of the invention there is also provided variants of a 4-helical cytokine, and 
variants of fragments thereof. When being polypeptides, variants are determined on the 
basis of their degree of identity or their homology with a predetermined amino acid 
35 sequence, said predetermined amino acid sequence being SEQ ID No. 3 or a fragment 
thereof. 

Accordingly, variants preferably have at least 60 % sequence identity, for example at least 
65% sequence identity, such as at least 70 % sequence identity, for example at least 75% 
40 sequence identity, for example at least 80% sequence identity, such as at least 85 % 
sequence identity, for example at least 90 % sequence identity, such as at least 91 % 
sequence identity, for example at least 91% sequence identity, such as at least 92 % 
sequence identity, for example at least 93 % sequence identity, such as at least 94 % 
sequence identity, for example at least 95 % sequence identity, such as at least 96 % 
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sequence Identity, for example at least 97% sequence identity, such as at least 98 % 
sequence identity, for example 99% sequence identity with the predetermined sequence. 

To determine the percent identity of two amino acid sequences or of two nucleic acids, the 
5 sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in 
the sequence of a first amino acid or nucleic acid sequence for optimal alignment with a 
second amino or nucleic acid sequence). The amino acid residues or nucleotides at 
corresponding amino acid positions or nucleotide positions are then compared. When a 
position in the first sequence is occupied by the same amino acid residue or nucleotide as 

10 the corresponding position in the second sequence, then the molecules are identical at that 
position. The percent identity between the two sequences is a function of the number of 
identical positions shared by the sequences (i.e., % identity # of identical positions/total # 
of positions (e.g., overlapping positions) x 100). In one embodiment the two sequences 
are the same length. The sequence identity is preferably calculated relative to the full 

15 length sequence of the molecule of the present invention. 

A degree of "sequence identity" of amino acid sequences Is a function of the number of 
identical amino acids at positions shared by the amino acid sequences. A degree of 
"sequence homology" or "sequence similarity" of amino acid sequences is a function of the 
20 number of amino acids, i.e. structurally related, at positions shared by the amino acid 
sequences. Sequence identity is determined by the alignment algorithm that performs 
global alignments which has been described by Smith TF and Waterman MS (Smith TF et 
al.) 

25 A list of the standard qualifiers and the default values for the alignment algorithm is given 
below: 



Standard (Mandatory) qualifiers 


Allowed values 


Default 


[-asequence] 
(Parameter 1) 


Sequence USA 


Readable sequence 


Required 


[-bsequence] 
(Parameter 2) 


Sequence database USA 


Readable 
sequence(s) 


Required 


-gapopen 


The gap open penalty is the score taken 
away when a gap is created. The best value 
depends on the choice of comparison 
matrix. The default value assumes you are 
using the EBLOSUM62 matrix for protein 
sequences, and the EDNAFULL matrix for 
nucleotide sequences. 


Number from 1.000 
to 100.000 


10.0 for any 
sequence 
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-gapextend 


The gap extension penalty is added to the 
standard gap penalty for each base or 
residue in the gap. This is how long gaps are 
penalized. Usually you will expect a few long 
gaps rather than many short gaps, so the 
gap extension penalty should be lower than 
the gap penalty. An exception is where one 
or both sequences are single reads with 
possible sequencing errors in which case 
you would expect many single base gaps. 
You can get this result by setting the gap 
open penalty to zero (or very low) and using 
the gap extension penalty to control gap 
scoring. 


Number from 0.100 
to 10.000 


0.5 for any 
sequence 


[-outfile] 
(Parameter 3) 


Output alignment file name 


Alignment output 
file 




Additional (Optional) qualifiers 


Allowed values 


Default 


-datafile 


This is the scoring matrix file used when 
comparing sequences. By default it is the 
file 'EBLOSUM62' (for proteins) or the file 
'EDNAFULU (for nucleic sequences). These 
files are found in the 'data' directory of the 
EMBOSS installation. 


Comparison matrix 
file in EMBOSS data 
path 


EBLOSUM62 for 
protein EDNAFULL 
ror una 


Advanced (Unprompted) qualifiers 


Allowed values 


Default 


-[no]brief 


Brief identity and similarity 


Boolean value 
Yes/No 


Yes 



An "unrelated" or "non-homologous" sequence shares less than 40% identity, though 
preferably less than 25% identity, with one of the 4-helical cytokine sequences of the 
5 present invention. The term "substantial identity" means that two peptide sequences, 
when optimally aligned, such as by the Smith and Waterman algorithm using default gap 
weights, share at least 80 percent sequence identity, preferably at least 90 percent 
sequence identity, more preferably at least 95 percent sequence identity or more (e.g., 99 
percent sequence identity). Preferably, residue positions which are not identical differ by 
10 conservative amino acid substitutions. 

Additionally, variants are also determined based on a predetermined number of 
conservative amino acid substitutions as defined herein below. Conservative amino acid 
substitution as used herein relates to the substitution of one amino acid (within a 
15 predetermined group of amino acids) for another amino acid (within the same group), 
wherein the amino acids exhibit similar or substantially similar characteristics. 

Within the meaning of the term "'conservative amino acid substitution" as applied herein, 
one amino acid may be substituted for another within the groups of amino acids indicated 
20 herein below: 
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• Amino acids having polar side chains (Asp, Glu, Lys, Arg, His, Asn, Gin, Ser, Thr, Tyr, 
and Cys,) 

• Amino acids having non-polar side chains (Gly, Ala, Val, Leu, He, Phe, Trp, Pro, and 
Met) 

5 • Amino acids having aliphatic side chains (Gly, Ala Val, Leu, He) 

• Amino acids having cyclic side chains (Phe, Tyr, Trp, His, Pro) 

• Amino acids having aromatic side chains (Phe, Tyr, Trp) 

• Amino acids having acidic side chains (Asp, Glu) 

• Amino acids having basic side chains (Lys, Arg, His) 
10 • Amino acids having amide side chains (Asn, Gin) 

• Amino acids having hydroxy side chains (Ser, Thr) 

• Amino acids having sulphor-containing side chains (Cys, Met), 

• Neutral, weakly hydrophobic amino acids (Pro, Ala, Gly, Ser, Thr) 

• Hydrophilic, acidic amino acids (Gin, Asn, Giu, Asp), and 
15 • Hydrophobic amino acids (Leu, He, Val) 

Preferred conservative amino acids substitution groups are: valine-leucine-isoieucine, 
phenylalanine-tyrosine, lysine-arginine, alanine-valine, and asparagine-glutamine. 

20 Accordingly, a variant or a fragment thereof according to the invention may comprise, 
within the same variant of the sequence or fragments thereof, or among different variants 
of the sequence or fragments thereof, at least one substitution, such as a plurality of 
substitutions introduced independently of one another. 

25 It is clear from the above outline that the same variant or fragment thereof may comprise 
more than one conservative amino acid substitution from more than one group of 
conservative amino acids as defined herein above. 

The addition or deletion of at least one amino acid may be an addition or deletion of from 
30 preferably 2 to 250 amino acids, such as from 10 to 20 amino acids, for example from 20 
to 30 amino acids, such as from 40 to 50 amino acids. However, additions or deletions of 
more than 50 amino acids, such as additions from 50 to 100 amino acids, addition of 100 
to 150 amino acids, addition of 150-250 amino acids, are also comprised within the 
present invention. The deletion and/or the addition may - independently of one another - 
35 be a deletion and/or an addition within a sequence and/or at the end of a sequence. 

The polypeptide fragments according to the present invention, including any functional 
equivalents thereof, may in one embodiment comprise less than 250 amino acid residues, 
such as less than 240 amino acid residues, for example less than 225 amino acid residues, 
40 such as less than 200 amino acid residues, for example less than 180 amino acid residues, 
such as less than 160 amino acid residues, for example less than 150 amino acid residues, 
such as less than 140 amino acid residues, for example less than 130 amino acid residues, 
such as less than 120 amino acid residues, for example less than 110 amino acid residues, 
such as less than 100 amino acid residues, for example less than 90 amino acid residues, 
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such as less than 85 amino acid residues, for example less than 80 amino acid residues, 
such as less than 75 amino acid residues, for example less than 70 amino acid residues, 
such as less than 65 amino acid residues, for example less than 60 amino acid residues, 
such as less than 55 amino acid residues, for example less than 50 amino acid residues. 

5 

"Functional equivalency" as used in the present invention is according to one preferred 
embodiment established by means of reference to the corresponding functionality of a 
predetermined fragment of the sequence. 

10 Functional equivalents or variants of a 4-helical cytokine will be understood to exhibit 
amino acid sequences gradually differing from the preferred predetermined 4-helical 
cytokine, as the number and scope of insertions, deletions and substitutions including 
conservative substitutions increases. This difference is measured as a reduction in 
homology between the preferred predetermined sequence and the fragment or functional 

15 equivalent. 

All fragments or functional equivalents of SEQ ID No. 3 are included within the scope of 
this invention, regardless of the degree of homology that they show to the respective, 
predetermined 4-helical cytokines disclosed herein. The reason for this is that some 
20 regions of the 4-helical cytokines are most likely readily mutatable, or capable of being 
completely deleted, without any significant effect on the binding activity of the resulting 
fragment. 

A functional variant obtained by substitution may well exhibit some form or degree of 
25 native cytokine activity, and yet be less homologous, if residues containing functionally 
similar amino acid side chains are substituted. Functionally similar in this respect refers to 
dominant characteristics of the side chains such as hydrophobic, basic, neutral or acidic, or 
the presence or absence of steric bulk. Accordingly, in one embodiment of the invention, 
the degree of identity is not a principal measure of a fragment being a variant or functional 
30 equivalent of a preferred predetermined fragment according to the present invention. 

One particularly preferred method of determining the degree of functional equivalence is 
by performing a biological or chemical assay such as the assays described in the appended 
examples. Preferred functional equivalents of SEQ ID No 3 are those that have a K D with 
35 respect to a predefined receptor which is less than 10 times higher than the K D of the 

polypeptide of SEQ ID No 1 with respect to the same receptor, more preferably less than 5 
times higher, more preferably less than 2 times higher. 

With respect to functional equivalence this may be defined in a biological assay based on a 
40 cytokine dependent or stimulated cell line. Such cell lines are e.g. available from American 
Type Culture Collection, P.O.Box 1549, Manassas, VA 20108 USA. The following cell lines 
at least are available for testing cytokines and in particular interleukins: 
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Accession number 

CRL-1841 

CRL-2003 

CRL-2407 

CRL-2408 

CRL-2409 

CRL-9589 

CRL-9591 

TIB-214 



10 



Description 

TH-2 clone ABE 

TF-1 

NK-92 

NK-92MI 

NK92CI 

AML-193 

MV-4-11 

CTLL-2 



Activity 

IL2 dependent, IL4 stimulated 

IL3 dependent 

IL2 dependent 

IL2 dependent 

IL2 dependent 

IL3 stimulated, GM-CSF dep. 

GM-CSF dependent 

IL2 dependent 



The following cell lines are available from DSMZ - Deutsche Sammlung von 
Mikroorganismen und Zellkulturen GmbH, Mascheroder Weg lb, D-38124 Braunschweig, 
GERMANY. As can be seen from the table, some of the cell lines can be used to broadly 
assess cytokine activity whereas others are only reported to respond to one or a few 
15 specific cytokines. 



Accession 
number 


Description 


Acvitity 


ACC 211 


Mouse hybridoma, B9 


IL6 dependent 


ACC 137 


Human acute myeloid leukemia, UT-7 


Constitutively cytokine 
responsive to various 
cytokines. 


ACC 104 


Human acute megakaryoblastic leukemia 


Respond with proliferation 
to: GM-CSF, IFN-alpha, 
IFN-a, IFN-gamma, IL2, 
IL3, IL4, IL6, IL15, NG F, 
SCF, TNF-a!pha, TPO 


ACC 247 


Human acute myeloid leukemia, OCI- 
AML5 


G-GSF, GM-CSF, IL3, FTL3- 
ligand 


ACC 271 


Human acute myeloid leukemia, MUTZ-2 


IL3, SCF, G-CSF, M-CSF, 
IFN-gamma 


ACC 334 


Human erythroleukemia, TF-1 


GM-CSF, IFN-gamma, IL3, 
IL4, IL5, IL6, IL13, UF, 
NGF, OSM, SCF, TNF- 
alpha, and TPO 



The TF-1 cell line mentioned above can be used for assaying IL13 function. This cell line is 
sensitive to various different cytokines but gives a very strong proliferative response when 
20 exposed to IL13. This cell line can in particular be used if there is no response in the IL4 
sensitive cell line (CT.h4S). Further cell lines which can be used for distinguishing between 
IL4 and IL13 activity include cell lines/hybridomas such as B-9-1-3 (Bouteiller et al.) and 
A201.1 (Andrews et al.). 
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Pharmaceutical uses of isolated polypeptides 

Apart from being used for diagnosis, it is also within the scope of the present invention to 
use an isolated polypeptide as defined in the invention for a pharmaceutical composition 
together with a pharmaceutical^ acceptable carrier. Such pharmaceutical compositions 
5 may be used for any of the purposes for which cytokines and in particular interleukin is 
used at present. 

Examples of such uses include the treatment of bone disorders, inflammation, for lowering 
blood serum cholesterol, allergy, infection, viral infections, hematopoietic disorders, 
10 preneoplastic lesions, immune related diseases, autoimmune related diseases, infectious 
diseases, tuberculosis, cancer, viral diseases, septic shock, reconstitution of the 
haematopoietic system, induction of the granulocyte system, pain, cardial dysfunction, 
CNS disorders, depression, artheritis, psoriasis, dermatitis, collitis, Crohn's disease, 
diabetes, in an individual in need thereof. 

15 

It is also within the scope of the present invention to use an isolated polypeptide according 
to the invention as an adjuvant or as an immune anhancer, for regulating TH2 immune 
responses, and for suppressing Thl immune responses. 

20 A further use of an isolated polypeptide of the invention is as a growth factor for 

administration to cell cultures or as a growth factor for veterinary use, e.g. for stimulating 
the growth of livestock. 

Immunotherapy 

25 Having identified a transcriptional and/or translational product of SEQ ID No 1 and/or SEQ 
ID No 5 as an etiological factor in B-CLL it is also within the scope of the present invention 
to perform an immunisation of a patient in need thereof against B-CLL, wherein the 
immunisation generates an immune response in the patient which recognises a 
translational product of SEQ ID No 2, SEQ ID No 4, SEQ ID No 6, SEQ ID No 7, SEQ ID No 

30 8, SEQ ID No 9, SEQ ID No 10 and SEQ ID No 11. A preferred immunotherapy is a 

vaccination against B-CLL by immunising an individual against a translational product of 
SEQ ID No 1 and/or SEQ ID No 5. In this way the individual builds up antibodies directed 
against said translational product and any developing B-CLL will be stopped by these 
antibodies. 

35 

Immunisation may be performed in various ways, such as by immunising said individual 
with at least one isolated polypeptide as defined the present invention and optionally 
adjuvants and carriers or immunising with an expression construct capable of expressing 
an isolated polypeptide according to the invention in the cells (DNA vaccination). 

40 

Another method comprises peptide loading of dendritic cells, or ex vivo expansion and 
activation of T-cells, or inducing a CTL response that targets cells expressing the 
polypeptide encoded by SEQ ID No 1 and/or SEQ ID No 5. 
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Antibodies 

Antibodies against any of the polypeptides belonging to the novel class of proteins 
identified by the present inventors can be produced by any known method of 
immunisation. 

In one embodiment, the antibodies are produced in a non-human mammal, or in an insect. 
If antibodies are to be used for therapy in human beings they are preferably subsequently 
humanised. In one embodiment, the antibody is formulated into a single-chain antibody. 

In another embodiment, in particular for therapeutic purposes, the host organism is a 
human being and the antibody is subsequently produced recombinantly in a non-human 
mammal, such as a mouse. The antibody may also be produced as a monoclonal antibody 
in a hybridoma. One way of producing a monoclonal antibody is described in US 5,681,729 
in which a human lymphocyte producing an antibody is generated by the steps, in the 
order mentioned, comprising 

1. transplanting human lymphocytes to a mouse lacking both functional T and B 
cells so that said human lymphocytes take in said mouse's body; 

2. immunizing said mouse with a desired antigen so as to generate human 
lymphocytes producing an antibody specific to said antigen; 

3. administering to said mouse an antiserum to mouse cells; 

4. recovering lymphocyte containing cells from said mouse; 

5. separating human lymphocytes from the recovered cells by centrifugation; and 

6. separating said human lymphocytes producing said antibody. 

7. immortalizing said human lymphocytes 

8. cloning the obtained immortalized human-derived lymphocytes producing said 
antibody; and 

recovering a monoclonal antibody specific to said desired antigen from the cloned 
immortalized human-derived lymphocytes. 

30 The antibodies of the present invention may be provided as part of a pharmaceutical 
composition. Such a pharmaceutical composition may be used for treating cancer, 
preferably for treating leukaemia, more preferably for treating B-CLL leukaemia, more 
preferably for treating poor prognosis B-CLL leukaemia. 

35 

Use of antibodies in therapy 

Antibodies directed against epitopes can be used for prevention and/or therapy of for 
example B-CLL. Antigenic epitopes can be used in vaccines to stimulate an immunological 
response in a mammal that is directed against cells having the B-CLL-associated epitope 
40 found in the AMB-1 protein(s) or functional equivalents. Antibodies directed against the 
antigenic epitopes of the invention can combat or prevent B-CLL. 

An antigenic epitope may be administered to the mammal in an amount sufficient to 
stimulate an immunological response against the antigenic epitope. The antigenic epitope 
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may be combined in a therapeutic composition and administered In several doses over a 
period of time that optimizes the immunological response of the mammal. Such an 
immunological response can be detected and monitored by observing whether antibodies 
directed against the epitopes of the invention are present in the bloodstream of the 
5 mammal. 

Such antibodies can be used alone or coupled to, or combined with, therapeutically useful 
agents. Antibodies can be administered to mammals suffering from any B-CLL that 
displays the B-CLL-associated epitope. Such administration can provide both therapeutic 
10 treatment, and prophylactic or preventative measures. For example, therapeutic methods 
can be used to determine the spread of a B-CLL and lead to its remission. 

Therapeutically useful agents include, for example, leukeran, adrimycin, 
aminoglutethimide, aminopterin, azathioprine, bleomycin sulfate, bulsulfan, carboplatin, 

15 carminomycin, carmusfme, chlorambucil, cisplatin, cyclophosphamide, cyclosporine, 
cytarabidine, cytosine arabinoside, cytoxin dacarbazine, dactinomycin, daunomycin, 
daunorubicin, doxorubicin, esperamicins, etoposide, fluorouracil, ifosfamide, interferon-cc, 
lomustine, melphalan, mercaptopurine, methotrexate, mitomycin C, mitotane, 
mitoxantrone, procarbazine HCI, taxol, taxotere (docetaxel), teniposide, thioguanine, 

20 thiotepa, vinblastine sulfate, vincristine sulfate and vinorelbine. Additional agents include 
those disclosed in Chapter 52, Antineoplastic Agents (Paul Calabresi and Bruce A. 
Chabner), and the introduction thereto, pp. 1202-1263, of Goodman and Gilman's "The 
Pharmacological Basis of Therapeutics", Eighth Edition, 1990, McGraw-Hill, Inc. (Health 
Professions Division). Toxins can be proteins such as, for example, pokeweed anti-viral 

25 protein, cholera toxin, pertussis toxin, ricin, gelonin, abrin, diphtheria exotoxin, or 
Pseudomonas exotoxin. Toxin moieties can also be high energy-emitting radionuclides 
such as cobalt-60, 1-131, 1-125, Y-90 and Re-186, and enzymatically active toxins of 
bacterial, fungal, plant or animal origin, or fragments thereof. 

30 Chemotherapeutic agents can be used to reduce the growth or spread of B-CLL cells and 
tumors that express the AMB-1 associated epitope of the invention. Animals that can be 
treated by the chemotherapeutic agents of the invention include humans, non-human 
primates, cows, horses, pigs, sheep, goats, dogs, cats, rodents and the like. In all 
embodiments human B-CLL antigens and human individuals are preferred. 

35 

Species-dependent antibodies can be used in therapeutic methods. Such a species- 
dependent antibody has constant regions that are substantially non-immunologically 
reactive with the chosen species. Such species-dependent antibody is particularly useful 
for therapy because it gives rise to substantially no immunological reactions. The species- 
40 dependent antibody can be of any of the various types of antibodies as defined above, but 
preferably is mammalian, and more preferably is a humanized or human antibody. 



WO 2004/046376 



PCT/DK2003/000794 



29 

Compositions 

Therapeutically useful agents can be formulated into a composition with the antibodies of 
the invention and need not be directly attached to the antibodies of the invention. 
However, in some embodiments, therapeutically useful agents are attached to the 
5 antibodies of the invention using methods available to one of skill in the art, for example, 
standard coupling procedures. 

Compositions may contain antibodies, antigenic epitopes or trypsin-like protease inhibitors. 
Such compositions are useful for detecting the AMB-1 protein (for example antigenic 
10 epitopes) and for therapeutic methods involving prevention and treatment of B-CLLs 
associated with the presence of the AMB-1 (for example antigenic epitopes). 

The antibodies, (and for example antigenic epitopes and protease inhibitors) can be 
formulated as pharmaceutical compositions and administered to a mammalian host, such 
15 as a human patient in a variety of forms adapted to the chosen route of administration. 
Routes for administration include, for example, intravenous, intra-arterial, subcutaneous, 
intramuscular, intraperitoneal and other routes selected by one of skill in the art. 

Solutions of the antibodies, (and for example antigenic epitopes and protease inhibitors) 
20 can be prepared in water or saline, and optionally mixed with a nontoxic surfactant. 

Formulations for intravenous or intra-arterial administration may include sterile aqueous 
solutions that may also contain buffers, liposomes, diluents and other suitable additives. 

The pharmaceutical dosage forms suitable for injection or infusion can include sterile 
25 aqueous solutions or dispersions comprising the active ingredient that are adapted for 
administration by encapsulation in liposomes. In all cases, the ultimate dosage form must 
be sterile, fluid and stable under the conditions of manufacture and storage. 

Sterile injectable solutions are prepared by incorporating the antibodies, antigenic epitopes 
30 and protease inhibitors in the required amount in the appropriate solvent with various of 
the other ingredients enumerated above, as required, followed by filter sterilization. 

Polynucleotides 

In a still further aspect the invention relates to an isolated polynucleotide selected from 
35 the group consisting of: 

i) a polynucleotide comprising nucleotides of SEQ ID No 5, 

ii) a polynucleotide encoding a polypeptide having the amino acid sequence of SEQ ID No 
3, 

40 iii) a polynucleotide, the complementary strand of which hybridises, under stringent 
conditions, with a polynucleotide as defined in any of i) and ii). 

iv) a polynucleotide which is degenerate to the polynucleotide of iii), and 

v) the complementary strand of any such polynucleotide. 
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The polypeptides encoded by the polynucleotides may furthermore 

a) have at least 60 % sequence identity with the amino acid sequence of SEQ ID No 3 
and have interleukin or cytokine activity, 

b) be recognised by an antibody, or a binding fragment thereof, which is capable of 

5 recognising an epitope, wherein said epitope is comprised within a polypeptide having 
the amino acid sequence of SEQ ID No 3; and/or 

c) be competing with a polypeptide having the amino acid sequence as shown in SEQ ID 
No 3 for binding to at least one predetermined binding partner such as a cytokine 
receptor. 

10 

Specific examples of fragments of SEQ ID No 1 include the nucleotide sequence selected 
from the group consisting of SEQ ID No: 12, SEQ ID No: 13, SEQ ID No: 14, SEQ ID No: 15, 
SEQ ID No: 16, SEQ ID No: 17 and SEQ ID No: 18. 

15 Hybridisation 

The entire nucleotide sequence of the coding sequence of SEQ ID No 1 and/or SEQ ID No 5 
or portions thereof can be used as a probe capable of specifically hybridising to 
corresponding sequences. To achieve specific hybridisation under a variety of conditions, 
such probes include sequences that are unique and are preferably at least about 10 

20 nucleotides in length, and most preferably at least about 20 nucleotides in length. Such 
probes can be used to amplify corresponding sequences from a chosen organism or 
individual by the well-known process of polymerase chain reaction (PCR) or other 
amplification techniques. This technique can be used to isolate additional nucleotide 
sequences from a desired organism or as a diagnostic assay to determine the presence of 

25 the coding sequence in an organism or individual. Examples include hybridisation screening 
of plated DNA libraries (either plaques or colonies; see e. g. Innis et al. (1990) PCR 
Protocols, A Guide to Methods and Applications, eds., Academic Press). 

The terms "stringent conditions" or "stringent hybridisation conditions" include reference to 
30 conditions under which a probe will hybridise to its target sequence, to a detectably 

greater degree than other sequences (e. g., at least twofold over background). Stringent 
conditions are target sequence dependent and will differ depending on the structure of the 
polynucleotide. By controlling the stringency of the hybridisation and/or washing 
conditions, target sequences can be identified which are 100% complementary to a probe 
35 (homologous probing). 

Alternatively, stringency conditions can be adjusted to allow some mismatching in 
sequences so that lower degrees of similarity are detected (heterologous probing). 

40 Generally, probes for hybridisation of this type are in a range of about 1000 nucleotides in 
length to about 250 nucleotides in length. 

An extensive guide to the hybridisation of nucleic acids is found in Tijssen, Laboratory 
Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid Probes, 
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Part I, Chapter 2, "Overview of principles of hybridization and the strategy of nucleic acid 
probe assays", Elsevier, New York (1993); and Current Protocols in Molecular Biology, 
Chapter 2, Ausubel, et al., Eds., Greene Publishing and Wiley-Interscience, New York 
(1995). See also Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual (2nd ed. 
5 Cold Spring Harbor Laboratory, Cold Spring Harbor, N. Y.). 

Specificity is typically the function of post-hybridisation washes, the critical factors being 
the ionic strength and temperature of the final wash solution. 

10 Generally, stringent wash temperature conditions are selected to be about 5°C to about 
2°C lower than the melting point (Tm) for the specific sequence at a defined ionic strength 
and pH. The melting point, or denaturation, of DNA occurs over a narrow temperature 
range and represents the disruption of the double helix into its complementary single 
strands. The process is described by the temperature of the midpoint of transition, Tm, 

15 which is also called the melting temperature. 

Formulas are available in the art for the determination of melting temperatures. 

Preferred hybridisation conditions for the nucleotide sequence of the invention include 
20 hybridisation at 42°C in 50% (w/v) formamide, 6X SSC, 0.5% (w/v) SDS, 100 mg/ml 
salmon sperm DNA. Exemplary low stringency washing conditions include hybridization at 
42°C in a solution of 2X SSC, 0.5% (w/v) SDS for 30 minutes and repeating. Exemplary 
moderate stringency conditions include a wash in 2X SSC, 0.5% (w/v) SDS at 50°C for 30 
minutes and repeating. 

25 

Exemplary high stringency conditions include a wash in 2X SSC, 0.5% (w/v) SDS, at 65°C 
for 30 minutes and repeating. Sequences that correspond to the AMB-1 gene or fractions 
thereof according to the present invention may be obtained using all the above conditions. 
For purposes of defining the invention, the high stringency conditions are used. 

30 

Mutations 

Finally, the invention provides a method for determining an increased or decreased 
predisposition for B-CLL comprising determining in a biological sample from an individual a 
germline alteration in a target nucleic acid sequence comprising 150,000 nucleotides, said 

35 target nucleic acid sequence comprising at least 10 nucleotides of SEQ ID No 1 and/or SEQ 
ID No: 5. This aspect is based on the finding of the importance of the expression product of 
SEQ ID No 1 and/ SEQ ID No:5, and the complete absence of any detectable expression 
product of SEQ ID No 1 and/or SEQ ID No:5 in healthy tissue and in patients with good 
prognosis B-CLL. It is highly likely that the difference is caused by a germline alteration. A 

40 germline alteration can be targeted by gene therapy methods and by the methods 
provided in the present invention. 

Preferably, said predisposition is a predisposition for poor prognosis of B-CLL. 
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Examples 

Example 1: cDNA cloning 

By Differential Display (Pardee et aL, 1992, J0rgensen et aL, 1999) part of a gene 
(hereafter referred to as AMB-1) was found that is expressed in unmutated B-CLL patients 
5 with poor prognosis. This gene is not found in the mutated B-CLL patients. When AMB-1 
was sequenced and aligned to known sequences in GenBank, perfect homology was found 
to 225 base pairs (bp) of human genomic DNA from chromosome 12. 

RNA was prepared using the RNeasy kit from Qiagen, as described by the manufactor 
10 (Qiagen, Hilden, Germany). RNA was prepared from patients with B-CLL without hyper 

mutation who, by PCR analysis, using primers FDP5 (CCTTTATGTGTGTGACAAGTG) and F10 
(ATCCAGCCAGGATGAAATAGAA), showed a high level of the resulting PCR fragment. Poiy- 

A + RNA was isolated from total RNA by the "MicroPoly(A)Purist" kit from Ambion, as 
described by the manufactor (Ambion, Inc., Texas, USA). Cloning-ready cDNA was 

15 prepared from 8 pg poly-A + RNA using the "ZAP Express® XR Library Construction Kit" 
from Stratagene as described by the manufactor (Stratagene, San Diego, USA). The cDNA 
was size fractionated and two size fractions (fraction-1: > 2500 bp and fraction-2 300- 
2500 bp) were independently ligated to pre-digested lambda Zap vectors and packed into 
phage particles as described by the manufacture (Stratagene, San Diego, USA). The titer 

20 was determined for each library and 200,000 pfu of from the fraction-1 library were plated 
onto two 22x22 cm screening plates (100,000 pfu on each plate) and 750,000 pfu of the 
fraction-2 library were plated on five 22x22 cm screening plates (150,000 pfu on each) as 
described by Stratagene, San Diego, USA. The plates were incubated at 37°C for 18 hours 
and the plaques transferred to replica nylon filters (Amersham) and denatured and 

25 renatured to allow hybridisation. All procedures were made as described by the 

manufactures (Stratagene, San Diego, USA & Amersham Biosciences, Buckinghamshire, 
UK). 

The filters were screened by independent hybridisations with alpha[ 32 P]-dATP-labelled 
30 DNA fragments; alpha[ 32 P]-dATP was purchased from Amersham Biosciences, 

Buckinghamshire, UK. Between succesive hybridisations, the old probe was removed by 
incubation for 20 min in 21 90-100°C water containing 0.1% SDS . The DNA fragments 
used as probes were (all positions relate to sequence ID # X): 1) pos. 48978-49250; 2) 
pos. 50011-51591; 3) pos. 51461-52182; 4) pos. 51901-52589; 5) pos. 53121-56521; 6) 
35 pos. 58163-59408. All hybridisations and washes were made according to the indtructions 
from Stratagene, San Diego, USA and Amersham Biosciences, Buckinghamshire, UK; 
Washing was done at a high stringency (0.1 x SSC at 65°C for 20 min). 

A total of 38 plaques that showed a positive response from one or more of the screenings, 
40 were excised from the screening plates and grown as plasmids as described (Stratagene, 
San Diego, USA). 
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A total of 8 cDNAs were identified by cDNA cloning or by a combination of cDNA cloning, 
PGR analysis and RACE (rapid amplification fo cDNA ends-polymerase chain reaction) using 
the the SMART RACE cDNA amplification kit (Clontech, Palo Alto, CA) according to the 
manufacturer's instructions. 



Example 2: Bioinformatic analysis of AMB-1. 

The DIMA and protein sequence data bases (GenBank and EBI) have been searched for 
sequences with similarity to AMB1. There is no significant DNA sequence similarity to any 
known gene. In particular, the coding region of the AMB1 mRNA (SEQ ID no 3) is not 
present in any known EST. The only significant match to the complete mRNA sequences 
and the DNA sequence of the putative coding region were BAC clones derived from the 
region on human chromosome 12 where the gene is located. The "AMB-1 gene" had not 
been annotated as a gene on the chromosome. Searches with the peptide sequence in the 
sptrnr data base of peptide sequences (includes Sprot and nrtrembl) showed a low 
similarity to putative intron maturases from cloroplasts and to bovine IL4. The percentage 
similarity to both maturases and bovine IL4 was low (25.6% and 30.3%, respectively) and 
the similarity to maturases only included a match to 75 amino acids of the much larger 
maturases. In contrast, the match to bovine IL4 extended over the full peptide sequence. 
IL4, and other 4-helical cytokines, include a leader peptide sequence (signal peptide) 
allowing the proteins to be secreted. The AMB1 peptide sequence includes a N-terminal 
peptide sequence with similarity to signal peptide sequences, however, it is not a typical 
sequence. 

A 3D search has been performed, where a peptide sequence is searched for similarity to 
known protein or peptide 3D-structures. The two best matches were the thioredoxin fold 
and the human 4-helical cytokine IL4 (Fig. 5). The two matches had almost similar 
probability scores (2.88 and 3.05, respectively). Searches with 4-helical cytokine peptide 
sequences (IL4, IL3, IL13 and GM-CSF) revealed that all could be folded into both a 4- 
helical cytokine structure and the thioredoxin fold. Alignment based on the structural 
similarity between IL4 (dliara) and AMB-1 is shown in Fig. 5. Thus, the AMB1 peptide 
sequence share this property with 4-helical cytokines. The structural similarity is not 
perfect (Fig. 6) and there are no obvious glycosylation sites in the AMB1 sequence, 
however, the similarity is significant. Alignment of the AMB1 peptide sequence with the 
sequences of IL4, IL3, IL13 and GM-CSF, based on their structures, showed very little 
sequence conservation but a high degree of structural conservation (Fig. 7). Based on this 
alignment, AMB1 has similarities to all the 4-helical cytokines, and the length of AMB1 and 
the position of gaps in the alignment could suggest a higher similarity to e.g. IL13, but 
searches at 3D-PSSM only identified a significant similarity to the structure of IL4, not 
IL13, IL3 or GM-CSF. However, the search algorithms are not perfect and may therefore 
not detect a possible low structural similarity. 
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Example 3: Differential expression of AMB-1 



Patient material 

Blood samples were collected from newly diagnosed untreated patients with B-CLL. 
5 Mononuclear cells were isolated by Lymphoprep separation (Nycomed Pharma, Oslo, 
Norway), and the percentage of CD5+CD20+ B-CLL cells in the mononuclear fraction was 
>90% in all samples as determined by flow cytometric analysis. 

Isolation of RNA and conversion to cDNA. 

10 Material for RNA production was isolated mononuclear cells from B-CLL patients or 

mononuclear cells from lymphoprep separated buffy coats from normal donors. Total RNA 
was isolated from 5xl0 7 or more cells using the QIAamp RNA Blood Mini kit (Qiagen, 
Valencia, CA) with DNAse treatment. RNA (lug) was converted to cDNA by incubation with 
a mixture of random-primers (lpg) and T24-primer (Ipg) for 5 minutes at 70°C. After 

15 cooling on ice, the reaction mixture was added to a final volume of 25pl containing 30U of 
AMV Reverse Transcriptase HC (Promega, Madison, WI f USA), lx First Strand Buffer 
(50mM Tris-HCI, pH 8.3, 50mM KCI, lOmM MgCI 2 , lOmM DTT, 0.5mM spermidine), 2.5mM 
of each dNTP and 60U rRNasin ribonuclease inhibitor (Promega, Madison, WI, USA). The 
reaction was performed for 60 minutes at 37°C. 

20 

Determination of somatic hypermutation status 

Two pi of cDNA was amplified using a GeneAmp PCR System 2700 (Applied Biosystems, 
Warrington, UK) with a 40 pmol specific upstream primer corresponding to 1 of the 6 
human VH family leader sequences (VH1: 5'-CCATGGACTGGACCTGGAGG-3', VH2: 5'- 

25 ATGGACATACTTTGTTCCAGC-3', VH3: 5'-CCATGGAGTTTGGGCTGAGC-3', VH4: 5'- 
ATGAAACACCTGTGGTTCTT-3', VH5: 5'-ATGGGGTCAACCGCGATCCT-3', VH6: 5'- 
ATGTCTGTCTCCTTCCTCAT-3') and a 40 pmol downstream primer (Cp:5'- 
GAGGCTCAGCGGGAAGACCTT-3' or Cy:5'-GGGGAAGACCGATGGGCCCCT-3') corresponding 
to a consensus sequence of the constant region of IgM or IgG respectively. The Reverse 

30 Transcription (RT)-PCR reaction contained IxPCR buffer (lOmM Tris-HCI, pH 9.0, 50mM 
KCI, 0.1% Triton X-100), 2.5mM MgCI 2 , 0.2mM of each dNTP and 1.5U Taq DNA 
Polymerase (Promega, Madison, WI, USA) in a final volume of lOOpl. The RT-PCR was 
performed under the following conditions: 1 cycle of 94°C for 5 minutes, 30 cycles of 
denaturation at 94°C for 30 sees, annealing at 62°C for 30 sec. and extension at 72°C for 

35 30 sec, and a final extension at 72°C for 7 minutes. The RT-PCR products were analysed 
on 2% agarose gels and sequenced in an HBI Prism 310 Genetic Analyzer (Perkin Elmer, 
Foster City ,CA, USA) using the BlgDye Terminator Cycle Sequencing Ready Reaction kit 
(Applied Biosystems, Warrington, UK) following the manufacturer's instructions. 

40 Sequences obtained from each sample were compared to germ line sequences in the V 
base sequence directory (I.M. Tomlinson, MRC Center for Protein Engineering, Cambridge, 
UK) using BLAST, and the closest germ line sequence was assigned. A gene sequence was 
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considered to be mutated if it had equal or more than 2% sequence alterations when 
compared to the closest published germ line sequence. 

RT-PCR that amplifies the Exon 2-Exon 3 junction 
5 To evaluate the mRNA expression pattern of AMB1 in unmutated and mutated B-CLL 
patients RT-PCR was performed. Exon-overlapping oligonucleotide primers were: 5'- 
ATCCAGCCAGGATGAAATAGAA-3' and 5'-CACTTGTCACACACATAAAGG-3'. The RT-PCR was 
performed in a GeneAmp PCR System 2700 thermal cycler with an Initial denaturation at 
94°C for 2 minutes, 40 cycles of 96°C for 25 sec, 62°C for 25 sec. and 72°C for 90 sees, 

10 and a final extension at 72°C for 5 minutes. The reactions contained 2pl cDNA, lx DDRT- 
PCR buffer (lOmM Trls-HCI, pH 8.3, 50mM KCI, 1.8mM MgCI 2/ 0.1% Triton X-100, 0.005% 
gelatine), 0.25mM of each dNTP, 30 pmol of each primer and 0.5U Taq DNA Polymerase 
(Promega, Madison, WI, USA) in a 30pl final volume. RT-PCR products were analyzed by 
gelelectrophoresis on 2% agarose gels and visualized with a Gene Genius Bio Imaging 

15 System (Syngene, Frederick, MD) after staining with ethidium bromide. An actin control 
RT-PCR was performed using the primers: 5'-TGACGGGGTCACCCACACTGTGCCCATCTA-3' 
and 5'-CTAGAAGCATTTGCGGTGGACGATGGAGGG-3'. 

RT-PCR that amplifies the Exon 1-Exon 3 junction: 

20 To evaluate the mRNA expression pattern of AMB1 in unmutated and mutated B-CLL 
patients RT-PCR was performed. Exon-overlapping oligonucleotide primers were: 5'- 
AGACGGCTCTCACC AATAAG-3' and 5 / -CACTTGTCACACACATAAAGG-3 , . The RT-PCR was 
performed in a GeneAmp PCR System 2700 thermal cycler with an initial denaturation at 
940C for 2 minutes, 40 cycles of 96°C for 25 sec, 62°C for 25 sec. and 72°C for 90 sees, 

25 and a final extension at 72°C for 5 minutes. The reactions contained 2^ cDNA, lx DDRT- 
PCR buffer (lOmM Tris-HCI, pH 8.3, 50mM KCI, L8mM MgCI 2 , 0.1% Triton X-100, 0.005% 
gelatine), 0.25mM of each dNTP, 30 pmol of each primer and 0.5U Taq DNA Polymerase 
(Promega, Madison, WI, USA) in a 30^il final volume. RT-PCR products were analyzed by 
gel electrophoresis on 2% agarose gels and visualized with a Gene Genius Bio Imaging 

30 System (Syngene, Frederick, MD) after staining with ethidium bromide (Continental Lab 
Products, San Diego, USA). 

Statistical analysis 

Statistical significance of the correlation between somatic hypermutation status and AMB1 
35 expression was analyzed using Fisher's exact test. 

Northern blotting. RNA from spleen, bone marrow and colon was purchased from 
Clontech. The AMB1 probe was an 896 base pair fragment (57661-56766) obtained by RT- 
PCR as described above with the primers 5'-TCACCTGGGAGCTCAGAGGA-3' and 5'- 
40 GTGATCCTGGGAGAATCTCT-3'. For Northern blotting, 5 pg of RNA was run on a 1% 
agarose-gel with 6% formaldehyde dissolved in 1 x MOPS (20 mM 3-(N-morpholino)- 
propane-sulfonic acid, 5 mM sodium acetate, 1 mM EDTA, pH 7.0) for size separation. The 
presence of equal amounts of RNA in each lane was ensured by ethidium bromide staining. 
The RNA was transferred to a Hybond-N membrane (Amersham, Little Chalfont, UK) by 
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capillary blotting and fixed by UV-irradiation. The filters were pre-hybridized for 1-2 hours 
at 42°C in 6 ml ULTRAhyb (Ambion, Austin, TX, USA) preheated to 68°C and hybridized 
overnight at 42°C after addition of further 4 ml containing the 32 P-labeled probe and 
sheared salmon sperm DNA (10 pg/ml). The membranes were washed for 2 x 15 min. at 
5 42°C in 2 x SSC, 0.1% SD5 followed by 1 x 15 min. in 0.2 x SSC 7 0.1 % SDS and 2 x 15 
min. in 0.1 x SSC, 0.1 % SDS at 42°C. The blot was developed and quantified by a 
phosphoimager. The sizes of the mRNAs were determined by reference to 18S and 28S 
ribosomal RNA, which were visualized by ethidium bromide staining. The AMB1 probe used 
for hybridization was radiolabeled with [<x- 32 P] dCTP using the Random Primers DNA 
10 Labeling System (Gibco BRL). 

Dot blot of multiple tissue expression (MTE) array. An MTE array (Clontech, Palo 
Alto, CA, USA) was hybridised to AMB1 at 65°C in ExpressHyb (Clontech) supplemented 
with sheared salmon sperm DNA (7.5 pg/ml) and human C 0 t-1 DNA (1.5 pg/ml) according 

15 to the manufacturers recommendations. The tissue types represented on the MTE array 
are shown in Figure 11. Following hybridisation the filter was washed 5 x 20 min. at 65°C 
in 2 x SSC (1 x SSC =150 mM NaCI, 15 mM sodium citrate, pH 7.0), 1% SDS and 2 x 20 
min at 65°C in 0.1 x SSC, 0.5% SDS. The blot was developed and quantified by a 
phosphoimager (Fuji Imager Analyzer BAS-2500, Image Reader ver. 1.4E, Image Gauge 

20 ver. 3.01 software, Fuji, Stockholm, Sweden). The membranes were stripped by boiling in 
0.5% SDS for 10 min. before rehybridization. The probe used for hybridization were 
radiolabeled with [a- 32 P] dCTP using the Random Primers DNA Labeling System (Gibco 
BRL, Rockville, ML, USA). 

25 Results 

Based on the known sequence of the AMB-1 cDNA RT-PCRs with primers that extend 
across the Exon 2-Exon 3 junction and the Exon 1- Exon 3 junction were set up. As shown 
in Figure 10, where the Exon 2- Exon 3 junction has been amplified, AMB-1 is expressed in 
30 the unmutated patients (UPN1-8) while no expression of AMB-1 is seen in mutated 
patients (UPN9-16). 

Northern blot analysis was performed to determine the size of AMB-l's mRNA transcript. 
As shown in Figure 3 the probe identifies transcripts in the samples from the three patients 

35 without somatic hypermutation (UPN1, UPN4 and UPN7). However, the probe does not 
recognise any transcripts from the patients with somatic hypermutation (UPN9, UPN10, 
UPN13, UPN21) or the various cell lines and tissue samples. Similar results were obtained 
when cell lines and tissue samples were investigated by RT-PCR (results not shown). Dot 
blot analysis on a purchased filter with 96 different RNA samples (Figure 8) only revealed 

40 specific binding to the total DNA control dot, but not to any specific tissue (results not 
shown). Thus AMB-1 is only expressed in B-CLL cells without hypermutation or AMB-1 is 
expressed at extremely low levels in other tissues. 
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We next tested the predictive value, in terms of IgV H mutational status, of expression of 
AMB-1 in 29 consecutive newly diagnosed patients. At present 13 somatically unmutated 
and 16 somatically mutated patients have been included in our prospective patient 
database. The sensitivity and specificity for expression of AMB-1 In predicting mutational 
5 status is well above 90% (p<0.0001), which is at the level obtained by sequencing. 



Example 4. Investigation of the prognostic significance of AMB-1 in terms of patient 
survival 

10 To obtain information about the ability of AMB-1 to predict survival or time to progression 
for B-CLL patients survival curves were made. For each patient the following data were 
collected: clinical stage at time of diagnosis (Rai and Blnet staging), date of diagnosis, date 
of first time of treatment and last follow up date. Time to treatment (progression free 
survival) and survival time were calculated based on these dates using the Microsoft Excel 

15 software. Survival times and progression free survival times were plotted be the Kaplan- 
Meier Method and compared using the log-rank test (Prism 3.0 Graph Pad software). 

34 newly diagnosed, untreated B-CLL patients were investigated as described above. As 
shown in Figure 2a-d AMB-1 expression is a good predictor of B-CLL patient survival and 
20 B-CLL patient time to progression. 

Example 5: siRNA assays. 

Materials and Methods: 

Frozen cells from unmutated or mutated B-CLL patients pruified and characterized for 
mutational status as previously described were thawed and Ficoll separated to obtain the 

25 live cells. Cells were counted and resuspended at 2xl0 6 cells/ml in RPMI 1640 medium 
with glutamax-1 (RPMI, GIBCO, Paisley, UK). 1x10 s cells (500^1 of cell suspension) were 
incubated on ice for 10 minutes with the appropriate siRNA or combination of siRNAs at a 
final concentration of lOOnM each. The suspension was electroporated using a 4mm 
cuvette (Molecular BioProducts, San Diego, CA, USA) in a BioRad Gene Pulser. Following 

30 electroporation the cells were incubated on ice for 10 minutes, washed once in RPMI with 
20% FCS and lOOunits/ml Penicillin and 100ng/ml Streptomycin (GIBCO, Paisley, UK), 
resuspended at a concentration of 4xl0 6 cells/ml in the same medium and incubated for 
48 hours. For each patient the optimal voltage allowing for maximum siRNA uptake was 
determined by making a voltage curve (300-450V, 960tiF) using a FITC-labeled unspecific 

35 probe (Xeragon, Qiagen, Hiiden, Germany) at a final concentration of 200nM and the 
uptake was followed by flow cytometry. 

The siRNAs were produced using the SilencerTM siRNA Construction Kit, (Ambion, Texas, 
USA) according to the manufacturers instructions. SiRNAs had the following target 
40 sequences: 



siRNA name: 



mRNA target sequence: 



< 
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Exon 1 A 


5 f - AAUAAGGGCAGGCAUCAUCCA -3' 


SEQ ID No: 19 


Exon 1 B 


5'- AAUUACACUGCCAGGUUUCCU -3' 


SEQ ID No:20 


Exon 2 A 


5 1 - AAUUCAUUCACAAUGAUUGCU -3 


SEQ ID No:21 


Exon 2 B 


5 ' - AAUUUCUCUUGGGUAAUUCAG-3 1 


SEQ ID No:22 


Exon 3 (DDend) A 


5'- AAAAUCAGAAUCUGCGCAGCA -3' 


SEQ ID No:23 


Exon 3 (DDend) B 


5'- AAUGAU GAU G G G AAG AAG GAA -3 ? 


SEQ ID No:24 


CDS A 


5'- AAACUUAGUAAUUGAGUGUGA 3' 


SEQ ID No:25 


CDS B 


5 f - AAUAUGUCACUUUCAUAAAGC -3 1 


SEQ ID No: 26 


Transcript on -strand A 


5'- AAU GAUGAU GG GAAG AAGGAA -3' 




Transcript on -strand B 


5'- AAACUAUGAGAUUUCAGAAGG -3 r 





Table 1: Overview of the siRNAs used in electroporation experiments 



After 48 hours of incubation the various B-CLL samples were counted and live and dead 
5 cells were distinguished by nigrosin exclusion (0.1% in Nigrosin in PBS from Fluka 7 Buchs, 
Switzerland). 

Results: 

Following 48 hours of incubation with control GFP siRNA or siRNAs against various regions 
10 of the cDNAs (see table 1), the viable cells and dead cells were counted based on the 
ability of the cells to exclude nigrosin. As shown in table 2, in 3 out of 4 patients 
introduction of siRNAs against Exon 1, Exon 2 and Exon 3 resulted in an increase of dead 
cells as compared to the controls (no siRNA or GFP siRNA). 



Treatment: 


UPN 67 


UPN 66 


UPN 62 


UPN 73 


No siRNA 


33.0% 


19.7% 


30.9% 


20.2% 


Control siRNA 


28.4% 


26.7% 


24.0% 


16.0% 


CDS siRNA 


23.7% 


25.5% 


21.0% 


13.3% 


Exl-3 siRNA 


47.3% 


54.1% 


17.2% 


29.1% 



Table 2: Percent dead cells of total cells following electroporation and incubation with 
siRNAs. 

20 Example 6: Identification of possible cytogenetic aberrations near or within the region 
encoding AMB-1 on chromosome 12. 

Rationale: The limited expression profile of AMB-1 suggests that it may be a result of a 
genetic aberration (e.g. deletion, translocation or alternative splicing) or that the promoter 
region controlling the expression of AMB-1 is uniquely activated in unmutated B-CLL. 
25 Another gene is situated about 200.000 bases upstream of the AMB-1 gene (SEQ ID No 1) 
on chromosome 12 and the inventors we have determined that this gene is expressed at 
equal levels in unmutated and mutated patients. 
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Methods: Using primers, initially spaced about 20.000 bp apart; this region on 
chromosome 12 is characterised in unmutated B-CLL patients. If genetic aberrations within 
the region are detected by PCR analysis of chromosomal DNA, detailed molecular genetic 
5 studies using FISH, microsatellite analysis and Southern blotting will be employed. The 
whole region from unmutated patients is sequenced. 

Example 7: Polyclonal antibodies 

10 

Production of polyclonal antibodies: 

Synthetic peptides CDLETNSEINKLIIYLFSQNNRIRF and CQVSKKHIIYSTFLSKNF were 
synthesized and conjugated to KLH (KJ.Ross-Petersen Aps, Holte, Denmark). Polyclonal 
antibodies were produced by immunization of rabbits with these conjugated peptides by 
15 DAKO (DAKO Cytomation A/S, Glostrup, Denmark). 

Testing of polyclonal antibodies: 

At the present time we have produced polyclonal antibodies from three rabbits that have 
been immunized with peptides representing predicted immunogenic regions of the protein 

20 that can be predicted from the CDS sequence (SEQ ID No: 17 ) (cDNA 4). The antibodies 
are tested in various ways. The proposed reading frame of CDS (SEQ ID No: 17) is 
expressed in 293 cells and the binding of antibodies to lanes on a western blot with non- 
transfected 293 ceils versus transfected 293 cells are compared. The size of the band in 
the lanes with transfected 293 cells is compared to the size of western blot bands in lanes 

25 with proteins from B-CLL patients. Specificity of the bands is secured by peptide blocking 
experiments. 

Additionally the polyclonal antibodies are tested in B-CLL immunoprecipitation experiments 
where the antibodies are used to immunoprecipitate the protein produced from the CDS 
30 sequence and the immunoprecipitates are analyzed by western blotting. 
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Example 8: Assay for the biological activity of 4-helix cytokines. 

The assay is based on the use of a cytokine dependent or stimulated cell line, for example 
an IL4 dependent cell line ("Optimisation of the CT.h4S bioassay for detection of human 
5 interleukin-4 secreted by mononuclear cells stimulated by phytohaemaglutinin or by 
human leukocyte antigen mismatched mixed lymphocyte culture", Petersen, S.L., Russell, 
C.A., Bendtzen, K. & Vindel0v, L.L., Immunology Letters 84 (2002) 29-39). Other 
examples of cytokine dependent cell lines include IL13 dependent cell lines. A list of 
commercially available cytokine dependent cell lines is disclosed in the general part of the 
10 description. These can all be used for assessing cytokine activity. The most preferred cell 
lines are those that are IL4 dependent. 

The assay can be performed in two ways. The first assay comprises providing 
recombinantly produced AMB1 protein or a functional equivalent thereof and determine the 
15 proliferation rate of the cell line. The proliferation rate (either rate of proliferation or ± 
proliferation) can be compared to the proliferation rate of the cell line exposed to IL4 or 
another known 4-helical cytokine or interleukin. 

If a positive result is obtained with a polypeptide an assay will be performed on the same 
20 cell line with the IL4 receptor blocked. This will check whether the stimulus goes through 
IL4R. 

The second assay is based on transfection of a gene encoding a 4-helix cytokine according 
to the invention into cytokine dependent cells and observe proliferation or non-proliferation 
25 during transient expression. 

Example 9: Cytokine receptor binding assays 

The following is a description of the layout of a cytokine receptor binding assay used to 
determine the cytokine activity of the 4-helix cytokines according to the present invention. 

30 

The assays can be performed with any cytokine receptor. Preferred receptors include but is 
not limited to the receptors for IL4 IL13, IL3, and GM-CSF. 

The ability of recombinant cytokine receptor to bind to 4-helical cytokine is assessed in a 
35 competitive binding ELISA assay as follows. Purified recombinant cytokine receptor (IL4, 
IL13, IL3 or GM-CSF receptors) (20 pg/ml in PBS) is bound to a Costar EIA/RIA 96 well 
microtiter dish (Costar Corp, Cambridge Mass., USA) in 50 pL overnight at room 
temperature. The wells are washed three times with 200 pL of PBS and the unbound sites 
blocked by the addition of 1% BSA in PBS (200 pl/well) for 1 hour at room temperature. 
40 The wells are washed as above. Biotinylated AMB-1 (1 pg/ml serially diluted in twofold 
steps to 15.6 ng/mL; 50 pL) is added to each well and incubated for 2.5 hours at room 
temperature. The wells are washed as above. The bound biotinylated AMB-1 is detected by 
the addition of 50 pl/well of a 1:2000 dilution of streptavidin-HRP (Pierce Chemical Co., 
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Rockford, III.) for 30 minutes at room temperature. The wells are washed as above and 50 
pL of ABTS (Zymed, Calif.) added and the developing blue color monitored at 405 nm after 
30 min. The ability of unlabeiled 4-helical cytokine to compete with biotinylated AMB-1, 
respectively, is assessed by mixing varying amounts of the competing protein with a 
5 quantity of biotinylated AMB-1 shown to be non-saturating (i.e., 70 ng/mL; 1.5 nM) and 
performing the binding assays as described above. A reduction in the signal (Abs 405 nm) 
expected for biotinylated 4-helical cytokine indicates a competition for binding to 
immobilised cytokine receptor. 

10 The above identified assays can be used to identify 4-helical cytokines with similar binding 
affinities as AMB-1 (SEQ ID No. 3). In the competitive binding assays biotinylated IL4, 
IL13, IL3, or GM-CSF can be used to identify 4-helical cytokines which can compete with 
these cytokines. 
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